arXiv: 1501.04288vl [astro-ph.CO] 18 Jan 2015 


Physics of the cosmic microwave background anisotropy* 


Martin Bucher 

Laboratoire APC, Universite Paris 7/CNRS 
Batiment Condorcet, Case 7020 
75205 Paris Cedex 13, France 
bucher@apc.univ-paris7.fr and 
Astrophysics and Cosmology Research Unit 
School of Mathematics, Statistics and Computer Science 
University of KwaZulu-Natal 
Durban 4041, South Africa 

January 20, 2015 


Abstract 

Observations of the cosmic microwave background (CMB), especially of its frequency 
spectrum and its anisotropies, both in temperature and in polarization, have played a 
key role in the development of modern cosmology and our understanding of the very 
early universe. We review the underlying physics of the CMB and how the primordial 
temperature and polarization anisotropies were imprinted. Possibilities for distinguish¬ 
ing competing cosmological models are emphasized. The current status of CMB ex¬ 
periments and experimental techniques with an emphasis toward future observations, 
particularly in polarization, is reviewed. The physics of foreground emissions, especially 
of polarized dust, is discussed in detail, since this area is likely to become crucial for 
measurements of the B modes of the CMB polarization at ever greater sensitivity. 


1 This article is to be published also in the book “One Hundred Years of General Relativity: From Genesis 
and Empirical Foundations to Gravitational Waves, Cosmology and Quantum Gravity,” edited by Wei-Tou 
Ni (World Scientific, Singapore, 2015) as well as in Int. J. Mod. Phys. D (in press). 
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Figure 1: Horn antenna used in 1964 by Penzias and Wilson to discover the CMB. 

( Credit: NASA image ) 

1 Observing the Microwave Sky: A Short History and 
Observational Overview 

In their 1965 landmark paper A. Penzias and R. Wilson |T57]. who were investigating the 
origin of radio interference at what at the time were considered high frequencies, reported a 
3.5 K signal from the sky at 4 GHz that was “isotropic, unpolarized, and free from seasonal 
variations” within the limits of their observations. Their apparatus was a 20 foot horn directed 
at the zenith coupled to a maser amplifier and a radiometer (see Fig. [l|. The maser amplifier 
and radiometer were switched between the sky and a liquid helium cooled reference source 
used for comparison. Alternative explanations such as ground pick-up from the sidelobes of 
their antenna were ruled out in their analysis, and they noted that known radio sources would 
contribute negligibly at this frequency because their apparent temperature falls rapidly with 
frequency. 

In a companion paper published in the same issue of the Astrophysical Journal, Dicke 
et al. [46j proposed the explanation that the isotropic sky signal seen by Penzias and Wilson 
was in fact emanating from a hot big bang, as had been suggested in the 1948 paper of Alpher 
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et al. [7] suggesting the presence of the photon blackbody component having a temperature of 
approximately a few K. Their prediction was based on considering the conditions required for 
successful nucleosynthesis in an expanding universe—that is, to create an appreciable fraction 
of primordial helium from the neutrons that are decaying as the universe is expanding. 

The importance of this discovery became almost immediately apparent, and others set out 
to better characterize this excess emission, which later would become known as the Cosmic 
Microwave Background (CMB), or sometimes in the older literature the Cosmic Microwave 
Background Radiation (CMBR). The two principal questions were: (i) To what extent is 
this background isotropic? (ii) How close is the spectrum to a perfect Planckian blackbody 
spectrum? 

The main obstacle to answering these questions was the lack of adequate instrumenta¬ 
tion, and this was the main reason why the first detection of the CMB anisotropy had to 
wait until 1992, when the COBE team announced their observation of a statistically signif¬ 
icant anisotropy of primordial origin after the dipole due to our motion with respect to the 
CMB had been subtracted [ 211] (see Figs. 2 and 3). The COBE satellite, in a low-Earth 
orbit, carried three instruments: the differential microwave radiometer (DMR) [210] . the 
far infrared absolute spectrophotometer (FIRAS) [136]. and the diffuse infrared background 
experiment (DIRBE). By today’s standards, the measurement of the cosmic microwave back¬ 
ground anisotropy was crude. The angular resolution was low—the width of the beam was 
7° (FWHM), and the sky map used for the analysis was smoothed to 10° to suppress beam 
artefacts. The contribution from instrument noise was large by contemporary standards. The 
COBE noise was 43, 16, and 22mKHz~ 1 / 2 for the 31, 53, and 90 GHz channels, respectively 
[21] |^] Nevertheless, COBE did provide a convincing first detection of the CMB anisotropy, 
and most importantly established the overall level of the primordial cosmological density per¬ 
turbations, which played a crucial role in determining the viability of the cosmological models 
in vogue at that time, which were much more numerous and varied than today. 

The COBE detection was followed by an intense effort to characterize the CMB anisotropy 
at greater sensitivity and on smaller angular scales. There were numerous experiments from 
the ground at locations where the column density of water in the atmosphere is particularly 
low, such as Saskatoon, the Atacama Desert in Chile, the South Pole, and the Canary Islands, 
as well as from stratospheric balloons. Figure [4] shows the state of play about four years 
after COBE, with two competing theoretical models plotted together with the data points 
available at that time. In Sec. [6] we shall present the physics of the CMB systematically, but 
jumping ahead a bit, we give here a few words of explanation for understanding this plot. In 
most theoretical models, the CMB is generated by an isotropic Gaussian stochastic process, 
in which case all the available information concerning the underlying theoretical model can 
be extracted by measuring the angular power spectrum of the CMB anisotropies. Because 
of isotropy, one may expand the map in spherical harmonics to extract its angular power 
spectrum, defined as 


WTT.obs _ 1 | T 1 2 

1 ( 2 £ + 1 ) ^ 1 £m ' 


( 1 ) 


It is customary to plot the quantity £(£ + l)Ce/(2n), which would be constant for a scale 

a By comparison, for WMAP the detector sensitivities were 0.7,0.7, 0.9,1.1, and 1.5 mK • Hz -1 / 2 for the K, 
Ka, Q, V, and W bands, respectively [1 96] . 
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Figure 2: The microwave sky as seen by the COBE DMR (differential microwave 
radiometer) instrument. The top panel shows the microwave sky as seen on a linear tem¬ 
perature scale including zero. No anisotropies are visible in this image, because the CMB 
monopole at 2.725 K dominates. In the middle panel, the monopole component has been 
subtracted. Apart from some slight contamination from the galaxy near the equator (corre¬ 
sponding to the plane of our Galaxy), one sees only a nearly perfect dipole pattern, owing to 
our peculiar motion with respect to the rest frame defined by the CMB. In the bottom panel, 
both the monopole and dipole components have been removed. Except for the galactic con¬ 
tamination around the equator, one sees the cosmic microwave background anisotropy along 
with some noise. ( Credit: NASA/COBE Science Team) 
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31.5 GHz 



Figure 3: COBE DMR individual frequency maps (with monopole and dipole com¬ 
ponents removed). Taking data at several frequencies is key to proving the primordial 
origin of the signal and removing galactic contaminants. Here are the three frequency maps 
from the COBE DMR observations. ( Credit: NASA/COBE Science Team) 
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Figure 4: State of CMB observations in 1998. After the COBE DMR detection at very- 
large angular scales at the so-called Sachs-Wolfe plateau, numerous groups sought to discover 
the acoustic peaks, or to find their absence. This compilation from 1998 shows the state of 
play at that time. While an unmistakable rise toward the first acoustic peak is apparent, it 
is not so apparent what happens toward higher t. Two theories broadly compatible with the 
data at that time are plotted, one with Q*. = 0 and another with Q fc ps 0.7, which predicts 
a higher amplitude of the primordial perturbations and an acoustic peak shifted to smaller 
angular scales, as explained in Sec. 7.3, ( Credit: Max Tegmark ) 
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Figure 5: Boomerang observation of acoustic oscillations. The Cj T power spectrum 
as measured by Boomerang is shown on the left without a fit to a theoretical model and 
on the right with the theoretical predictions for a spatially flat cosmological model with an 
exactly scale invariant primordial power spectrum for the adiabatic growing mode. (Credit: 
Boomerang Collaboration ) 

invariant pattern on the skyj^j This is the quantity plotted in Fig. [dj where the question posed 
at the time was whether there is a rise to a first acoustic peak followed by several decaying 
secondary acoustic peaks, as indicated by the solid theoretical curves. This structure is a 
prediction of simple inflationary models—or more precisely, of models where only the adiabatic 
growing mode is excited with an approximately scale invariant primordial spectrum. In this 
plot, one sees fairly convincing evidence for a rise in the angular power, but the continuation 
of the curve is unclear. 

Several experiments contributed to providing the first clear detection of the acoustic os¬ 
cillations, namely TOCO [ 138] . MAXIMA[75], and Boomerang [1431 fl5] . Figure [5] shows the 
power spectrum as measured by one of these experiments, namely Boomerang, where a series 
of well defined acoustic oscillations is clearly visible. 

In the meantime, parallel efforts were underway in Europe and in the United States to 
prepare for another CMB space mission to follow on COBE at higher sensitivity and angular 
resolution. The COBE beam, which was 7° FWHM, did not use a telescope but rather 
microwave feed horns pointed directly at the sky. While not allowing for a high angular 
resolution, the feed horns had the advantage of producing well defined beams with rapidly 
falling sidelobes. The US NASA WMAP satellite, launched in 2001, delivered its one-year data 
release in 2003 (including TT and TE) [ 235 j-[ 243] . and its first polarization data (including also 
EE) in 2006 |244] - [2T7j (see Figs. 6 and 7). WMAP continued taking data for nine years, and 
released installments of papers based on the five-, seven-, and nine-year data releases, in which 
the results were further refined benefiting from longer integration time (which nominally 
would shrink error bars in proportion to 1 /i/Abs) as well as improved instrument modeling 

b Strictly speaking, scale invariance on the sky can be defined only asymptotically in the £ —> oo limit, 
because the curvature of the celestial sphere breaks scale invariance. The use of £(£ + 1) rather than £ 2 here 
is an historical convention. 
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Figure 6: WMAP single temperature frequency maps. WMAP observed in five fre¬ 
quency bands. ( Credit: NASA/WMAP Science Team) 
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Figure 7: WMAP single frequency polarization maps. The lines (which may be thought 
of as double-headed vectors) show the amplitude and orientation of the linear polarization of 
the measured CMB anisotropy in the indicated bands. The amplitude P = \J Q 2 + U 2 is also 
shown using the indicated color scale. ( Credit: NASA/WMAP Science Team ) 


pT9]-[26f)]. WMAP used horns pointed at a 1.4 m x 1.6 m off-axis Gregorian mirror to obtain 
diffraction limited beams (although the mirror was under-illuminated somewhat to reduce far 
sidelobes). The 20 detectors were based on coherent amplification using HEMTs (high electron 
mobility transistors)—the state of the art in coherent low-noise amplification at the time. 
The coherent detection technology used by WMAP had the advantage that the electronics 
could be passively cooled. The competing incoherent bolometric detection technology permits 
superior sensitivity, which operates almost at the quantum noise limit of the incident photons, 
but requires cooling to approximately 100 mK. The cryogenic system required was judged 
to present substantial risk for a space mission at the time, and this was likely one of the 
reasons why WMAP was selected over one of its competitor missions, which was similar to 
the European Planck HFI proposal. 








The successor to WMAP was the European Space Agency (ESA) Planck satellite, which 
was launched in May 2009 |159j |185| . Planck consisted of two instruments: (i) a low-frequency 
instrument with three channels (at 30,44 and 70 GHz) using a coherent detection technology, 
and (ii) a high-frequency instrument using cryogenically-cooled bolometric detectors observing 
in six bands (100, 143, 217, 353, 545 and 857GHz). All bands except the highest two bands 
were polarization sensitive and had a diffraction limited angular resolution, starting at 33 
arcmin for the 30 GHz channel and going down to 5.5 arcmin for the 217 GHz channel. (The 
top three channels have an angular resolution of approximately 5’.) Planck HF1 took data 
from August 2009 to January 2012, when the coolant for its high-frequency instrument ran 
out. The Planck Collaboration reported its first results for cosmology in March 2013 based on 
its temperature anisotropy data. The first results for cosmology using the polarization data 
collected by Planck are expected in late 2014. 

Figure [8] shows the Planck full-sky maps of the intensity of the microwave emission in nine 
frequency bands ranging from 30 GHz to 857 GHz. Figure [9] shows a cleaned full-sky map 
where a linear combination of the single band maps has been taken in order to isolate the 
primordial cosmic microwave background signal. While the fluctuations in the cleaned map 
in Fig. [9] do not appear to single out any particular direction in the sky and appear consistent 
with an isotropic Gaussian random process, the maps in Fig. [8] show a clear excess in the 
galactic plane. These full-sky maps use a Mollcweide projection in galactic coordinates, with 
the galactic center at the center of the projection. Even though the galactic contamination 
depends largely on the angle to the galactic plane, with a lesser tendency to increase toward 
the galactic center, considerable structure can be seen in the galactic emission over a broad 
range of angular scales. The central bands include the least amount of galactic contamination, 
which visibly is much larger for the lowest and the highest frequencies shown. 

The CMB dipole amplitude is AT = 3.365±0.027 mK and directed toward (l, b ) = (264.4°± 
0.3°, 48.4 ± 0.5°) in galactic coordinates The maps have been processed to remove the 

2.725 K CMB monopolc component as well as the smaller CMB dipole having a peak-to-peak 
amplitude of approximately 6.73 mK, so that only the spherical harmonic multipoles with 
£ >2 are included. CMB angular power is typically expressed in terms of Vp = £(£ + 1)Cp/2tt, 
which in the flat sky approximation corresponds to rms power (in / iK 2 ) per unit logarithmic 
interval in the spatial frequency. A scale invariant temperature spectrum on the celestial 
sphere would correspond to £ 2 Cp being constant. For comparison we give here a few ballpark 
numbers characterizing the strength of the primordial CMB temperature anisotropy. Figure [10] 
plots the CMB power spectrum as observed by Planck together with the predictions of the 
fit to a six-parameter theoretical model, which we will discuss further below. The magnitude 
at low £, before the rise to the first acoustic, or Doppler, peak at £ ~ 220, is Vp ~ 10 3 /iA' 2 , 
which would correspond to an rms temperature in the neighborhood of 30 fiK. The rise to the 
first acoustic peak increases the power by about a factor of six, and then after approximately 
five oscillations, damping effects take over, making the oscillations less apparent and causing 
the spectrum to suffer a quasi-exponential decay. 

Although the detectors (to the extent that their response is ideal or linear) measure 
the intensity expressed as the “spectral radiance” or “specific intensity” I u (having units 
of erg s _1 cm -2 Hz -1 str -1 ) averaged over a frequency band defined by the detector, it is 
convenient to re-express these intensities in terms of an effective temperature. There are two 
types of effective temperature: the Rayleigh-Jeans (R- J) temperature and the thermodynamic 
temperature. It is important to keep in mind the distinction, especially at high frequencies [in 
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Figure 8: Planck single frequency temperature maps. The ESA Planck satel¬ 
lite surveyed the sky in nine broad [i.e. (A v)jv ~ 0.3] frequency bands, centered at 
30,44,70,100,143,217,353,545, and 857 GHz, shown in galactic coordinates. The units are 
CMB thermodynamic temperature [see Eq. ([3])]. The nonlinear scale avoids saturation in 
regions of high galactic emission. ( Credit: ESA/Planck Collaboration ) 
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Figure 9: Planck internal linear combination map. A linear combination of the Planck 
single frequency maps (shown in Fig. [8]) is taken. This linear combination is optimized to filter 
out any unwanted contaminants based on their differing frequency dependence. The success 
of this procedure to remove the galactic contaminants is evident from the disappearance of 
excess power along the equator corresponding to the galactic plane. (For more details, see 
Ref. 161.) ( Credit: ESA/Planck Collaboration) 


this context v ;> i^cmb — h~ 1 (k B TcMB ) = 57 GHz] where the factor relating the two becomes 
large. 

In the R-J limit, where ( hv/k^T ) <C 1, the blackbody spectral radiance may be approx¬ 
imated as I v = B V (T) ~ 2{y/c) 2 {k-BT), as one would obtain classically from the density of 
states of the radiation held and assigning an energy (k&T) to each harmonic oscillator degree 
of freedom. If we invert, assuming the R-J limit above (regardless of whether this limit is 
valid), we obtain the following definition for the R-J temperature 


t *- m = ^(;) 2/ - < 2 > 

The “thermodynamic” temperature corresponding to a specific intensity I u at a certain fre¬ 
quency, on the other hand, is obtained by inverting the unapproximated blackbody expression 
I v = B U (T) = {^Ihv 2, /c 2 )[ey.Y>{hv/k^T) — l] -1 . For the CMB, where the variations about the 
average CMB temperature are small (i.e., A T/T « 10 -5 ), linearized perturbation theory is 
valid and one can invert this equation, obtaining 


JTcmb 


(e x - l) 2 
x 2 e x 


5T r .j 


J 


(3) 


where x = hvjk^T. This factor is approximately one for a: < 1, but rises exponentially for 
x > 1. 

It should be remembered that for adding optically thin emissions, it is the intensities, and 
not the thermodynamic temperatures, that add. Moreover, foreground emission power laws 
know nothing about T CMB and thus are naturally expressed using either the specific intensity, 
or equivalently the R-J temperature. 
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Figure 10: CMB angular power spectrum as measured by Planck. The binned Cj T 
power spectrum as given in the Planck 2013 release is plotted with error bars that combine 
uncertainties from cosmic variance (dominant at low (!) and instrumental noise (dominant at 
high (). The solid curve indicates the theoretical predictions of the six-parameter concordance 
model with the la cosmic variance for the adopted binning scheme. The lower panel shows the 
residuals. For more details, see Ref. cm The exquisite fit, seen here up to about £ ~ 2200, 
into the damping tail, has been shown to extend to smaller angular scales by ACT [ISS] and 
SPT | 102| . ( Credit: ESA/Planck Collaboration ) 
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Other important milestones of CMB observation include the DASI discovery of the po¬ 
larization of the CMB in 2002 cma and the observations at small angular scales carried out 
by the ACT and SPT teams, which at the time of writing constitute the best measurements 
of the microwave anisotropies on large patches of the sky at high angular resolution, mea¬ 
suring the power spectrum up to t ~ 10,000. The DASI experiment was carried out using 
several feed horns pointed directly at the sky. Correlations were taken of the signals from 
pairs of horns, thus measuring the sky signal interferometrically. The 6 m diameter Atacama 
Cosmology Telescope (ACT) situated in the Atacama desert in Chile [ ITM 58] and the 10 m 
SPT (South Pole Telescope) located at the Amundsen-Scott South Pole Station in Antarctica 
jTTM HIM OSH probe the microwave sky on very small angular scales, beyond the angular 
resolution of Planck. CMB observations are almost always diffraction limited, so that the 
angular resolution is inversely proportional to the telescope diameter. Ground based instru¬ 
ments, despite all their handicaps (i.e., atmospheric interference, lack of stability over long 
timescales, ground pickup from far sidelobes) will always outperform space based experiments 
in angular resolution. There are dozens of other suborbital experiments not mentioned here 
but which paved the road for contemporary CMB observation. These experiments were im¬ 
portant not only because of their observations, but also because of their role in technology 
development and in the development of new data analysis techniques. More information on 
these experiments can be found in other earlier reviews]]] 


2 Brief Thermal History of the Universe 


The big bang model of the universe is an unfinished story. Certain aspects of the big bang 
model are well established observationally, while other aspects are more provisional and repre¬ 
sent our present best bet speculation. In the account below, we endeavor to distinguish what 
is relatively certain and what is more speculative. 

If we assume: (i) the correctness of general relativity, (ii) that the universe is homo¬ 
geneous and isotropic on large scales (at least up to the size of that part of the universe 
presently observable to us), and (iii) that for calculating the behavior of the universe, small- 
scale anisotropies can be averaged over, we obtain the Friedmann-Lamaitre-Robertson Walker 
(FLRW) solutions to the Einstein field equations, which we now describe. The metric for this 
family of solutions takes the following form: 


ds 2 = — dt 2 + a 2 (t)'f i jdx l dx j , (4) 

where ( i,j = 1,2,3) and the line element d£ 2 = 7 ijdx l dx^ describes a maximally symmetric 
three-dimensional space that may be Euclidean (flat), hyperbolic, or spherical j^] For the flat 

c See http://lambda.gsfc.nasa.gov/links/experimental_sites.cfm for an extensive compilation of CMB exper¬ 
iments with links to their websites. The book |150| provides an insightful account emphasizing the early 
history of CMB observations with contributions from many of the major participants. 
d More generally, the three-dimensional line element 

dx\ -f- dx 2 “t - dx 3 
(l + (|) ( X ! 2 + X 2 2 + X 3 2 )) 2 

where k = +1,0, or —1 corresponds to spherical, flat (Euclidean), or hyperbolic three-dimensional geometry, 
respectively. Apart from an overall change of scale, these are the only geometries satisfying the hypotheses of 
spatial homogeneity and isotropy. By substituting tan(y/2) = r/2 or tanh(y/2) = r/2 for the cases k = +1 or 


ds 2 = 'Yijdx 1 
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( 5 ) 


case, which is the most important and most often discussed, Eq. Q takes the form 

ds 2 = — dt 2 + a 2 {t)[dx\ + dx 2 2 + dx 3 2 ]. 


Here a(t) represents the scale factor of the universe. Under the assumption of homogeneity 
and isotropy, the stress-energy tensor (expressed as a mixed tensor with one contravariant and 
one covariant index) must take the form 


(pit) 0 0 0 \ 

0 pit) 0 0 

0 0 pit) 0 

v o o o p(t)) 


( 6 ) 


The Einstein held equations G pv = — {l/2)g flu R = ( 87 tG)T^ v may be reduced to the 

two equations 


H 2 (t ) 


a 2 (f) 

a 2 (f) 


87 tGn 
3 


P(t) 


k 

a 2 (t ) 


(7) 


and 


a{t) 

a(t) 


4t tG 

~Y~ 


(p + 3p). 


( 8 ) 


(For more details, see for example the discussion in the books by Weinberg [220] or Wald 

12233.) 

The above discussion, which included only general relativity, is incomplete because no 
details concerning the dynamics of p(t) and p(t) have been given. The only constraint imposed 
by general relativity is stress-energy conservation, which in the most general case is expressed 
as T^ v ’ v = 0 , but in the special case above takes the form 


^U = -3 («)[„(«)+„(«)]• (9) 

The relationship between p and p is known as the equation of state, and for the special 
case of a perfect fluid p depends only on p and on no other variables. Two special cases 
are of particular interest: a nonrelativistic fluid, for which p/p = 0 , and an ultrarelativistic 
fluid, for which p/p = 1/3. Until the mid-90s it was commonly believed that except at the 
very beginning of the universe, a two-component fluid consisting of a radiation component 
and matter component suffices to account for the stress-energy filling the universe. I 11 such a 
model 

p(t) = ft "'° (^fj) +Pr '°(/fj) ■ (10) 


Here the subscript 0 refers to the present time. For the radiation component, there is obviously 
the contribution from the CMB photons, whose discovery was recounted above in Sec. [lj If we 
assume that the universe started as a plasma at some very large redshift with a certain baryon 

k = —1, we may obtain the more familiar representations ds 2 = d\ 2 +sin y 2 dl7(2) 2 or ds 2 = dy 2 +sinhy 2 (ifl( 2 ) 2 ; 
respectively, where dQ ( 2 ) 2 = dd 2 + sin 9 2 df 2 . 
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number density, most conveniently parametrized as a baryon-to-photon or baryon-to-entropy 
ratio, we would conclude that there was statistical equilibrium early on between the various 
species, and from this assumption we can calculate the neutrino density today. It turns out 
that the neutrinos are colder than the photons because the neutrinos fell out of statistical 
equilibrium before the electrons and positrons annihilated. The e + e _ annihilation had the 
effect of heating the photons relative to the neutrinos, because all the entropy that was in 
the electrons and positrons was dumped into the photons. Consequently, assuming a minimal 
neutrino sector, we can calculate the number of effective bosonic degrees of freedom due to 
the cosmic neutrino background, which according to big bang theory must be present but has 
never been detected. Measuring the number of baryons in the universe today is more difficult, 
and is the subject of an extensive literature. The realization of a need for some additional 
nonbaryonic “dark” matter dates back to Fritz Zwicky’s study of the Coma cluster in 1933 
and is a long story that we do not have time to go into. Today “cold dark matter” is part 
of the concordance model, where the name simply means that whatever the details of this 
extra component may be (the lightest supersymmetric partner of the ordinary particles in 
the Standard electroweak model, the axion, or yet something else), for analyzing structure 
formation in the universe, we can treat this component as nonrelativistic (cold) particles that 
can be idealized as interacting only through gravitation. 

In the mid-1990s it was realized that a model where the stress-energy included only matter 
and radiation components could not account for the observations. A crucial observation for 
many researchers was the measurement of the apparent luminosities of Type la supernovae as a 
function of redshift [ 15811196] . although at that time there were already several other discordant 
observations indicating that the universe with only matter and radiation could not account 
for the observations]^ To reconcile the observations with theory, another component having a 
large negative pressure, such as would arise from a cosmological constant, was required. 

A cosmological constant makes a contribution to the stress-energy proportional to the 
metric tensor g /w meaning that p = — p exactly. This relation inserted into the right-hand 
side of Eq. (|9]) implies that any expansion (or contraction) does not change the contribution 
to the density from the cosmological constant. 

With a cosmological constant, the density of the universe scales in the following manner: 


pit) = pAfi 


(a(t)) +Pmfi 




( 11 ) 


The cosmological constant is most important at late times, when its density rapidly comes 
to dominate over the contribution from nonrelativistic matter (consisting according to our 
best current understanding of baryons and a so-called cold dark matter component). It is 
customary to express the contributions of each component as a fraction of the critical density 
Pent = 3H 2 /8ttG , which would equal the total density if the universe were exactly spatially 
flat. For the ith component, we define fh = (pi/p c nt)- 

From the present data, we do not know whether the new component (or components) with 
a large negative pressure is a cosmological constant, although current observations indicate 
that w = p/p for this component must lie somewhere near w — — 1. A key question of many 
of the major initiatives of contemporary observational cosmology is to measure w (and its 
evolution in time) in order to detect a deviation of the behavior of the dark energy from that 
predicted by a cosmological constant. 

e See for example some of the contributions in Ref. }222l and references therein. 
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The earliest probe of the hot big bang model arises from comparing the primordial light 
element abundances as inferred from observations to the theory of primordial nucleosynthesis. 
In a hot big bang scenario, there is only one free parameter the baryon-to-photon ratio t/b , 
or equivalently, the baryon-to-entropy ratio rjs, which is related to the former by a constant 
factor. As discussed above, in the hot big bang theory at early times, in the limit as a(t) —» 0, 
we have p —> oo, and T —> oo. For calculating the primordial light element abundances, we 
do not need to extrapolate all the way back to the initial singularity, where these quantities 
diverge. Rather it suffices to begin the calculation at a temperature sufficiently low so that all 
the baryon number of the universe is concentrated in nucleons (rather than in free quarks), but 
high enough so that there are only protons and neutrons rather than bound nuclei consisting 
of several nucleons, and so that the reaction 

p + e~ ^ n + u e , (12) 

(as well as related reactions) proceeds at a rate faster than the expansion rate of the universe 
H. Under this assumption, the ratio of protons to neutrons is determined by an equilibrium 
condition expressed in terms of chemical potentials 

p(p) + p(e~) = p(n) + p(v e ) + AM, (13) 


where 


AM = M N - M P - M e = 939.56 MeV - 938.27 MeV - 511 KeV = 0.78 MeV. 


(14) 


At first, when T (AM), the protons and neutrons have almost precisely equal abundances, 
but then, as the universe cools down and expands, the neutrons become less abundant than the 
protons owing to the neutron’s slightly greater mass. Then the reactions (12) freeze out—that 
is, their rate becomes small compared to H —and the neutron-proton ratio becomes frozen 
in. Subsequently protons and neutrons fuse to form deuterium, almost all of which combines 
to form Me, either directly from a pair of deuterons or through first forming 3 He, which 
subsequently fuses with a free neutron. Almost all the neutrons that do not decay end up 
in the form 4 He, because of its large binding energy relative to other low A nuclei, but trace 
amounts of other light elements are also produced. Even though other heavier nuclei have 
a larger binding energy per nucleon and their production would be favored by equilibrium 
considerations, the rates for their production are too small, principally because of the large 
Coulomb barrier that must be overcome. 

In primordial big bang nucleosynthesis (BBN) the only free parameter is t)b■ Since nucle¬ 
osynthesis takes place far into the epoch of radiation domination, the density as a function 
of temperature is determined by the equation p(T) = (7r 2 /30)Af re iT 4 where N ve \ is the effec¬ 
tive number of bosonic degrees of freedom. The expansion rate then is important because 
it determines for example how many neutrons are able to be integrated into nuclei heavier 
than hydrogen before decaying. Based on the particles known to us that are ultrarelativistic 
during nucleosynthesis, the photon, electron, positron, neutrinos, and antineutrinos, we think 
that we know what value N re \ should have. But nucleosynthesis can be used to test for the 
presence of extra relativistic degrees of freedom. We shall see later that the CMB can also be 
used to constrain N rc For a review with more details about primordial nucleosynthesis, see 
Ref. [228]. 
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3 Cosmological Perturbation Theory: Describing a Nearly 
Perfect Universe Using General Relativity 


The broad brush account of the history of the universe presented in the previous section 
tells an average story, where spatial homogeneity and isotropy have been assumed. As we 
shall see, early on, at large z, this story is not far from the truth, especially on large scales. 
But a universe that is exactly homogeneous and isotropic would be quite unlike our own, in 
that there would be no clustering of matter, observed very concretely in the form of galaxies, 
clusters of galaxies, and so forth. In this section, we present a brief sketch of the theory of 
cosmological perturbations. For excellent reviews with a thorough discussion of the theory of 
cosmological perturbations in the framework of general relativity, see Refs. [ 103j and [ 142j . 

For the “scalar” perturbations^] we may write the line element for the metric with its 
linearized perturbations $(x, 77 ) and \l/(x, rj) in the following form: 

ds 2 = a 2 (r])[— (1 + 2<F)<i ?/ 2 + (1 — 2 T)dx 2 ]. (15) 


The functions T (x, rj) and $(x, rj) are the Newtonian gravitational potentials. At low velocities 
T(x, 77 ) is most relevant, but <3>(x, rj) can probed obsevationally using light or some other 
type of ultrarelativistic particles. When there are no anisotropic stresses (i.e., all the partial 
pressures of the stress-energy tensor are equal), <F = T. 

For a universe filled with a perfect fluid with an equation of state p = p(p), the evolution 
of the Newtonian gravitational potential <F(x, rj) is governed by the equation (see Ref. |142j 
for a derivation and more details) 

<F" + 377(1 + c 2 )& - c s 2 V 2< F + [2 V! + (1 + 3 c s 2 )n 2 }$ = 0, (16) 


where c 2 = dp/dp, % = a'/a, and ' = d/dr]. Here 77 is the conformal time, related to the more 
physical proper time by the relation dt = a{rj)dr). 

If the spatial derivative term (i.e., V 2 < f>) is neglected, which is an approximation valid on 
superhorizon scales, then the derived quantity [IUJ 1128] 


c 


4 > + 


2 (4> + 7r 1 <f>') 

3 (1 ~t~ w) ’ 


(17) 


(where w = p/p) is conserved on superhorizon scales. This approximate invariant, which can 
be related to the spatial curvature of the surfaces of constant density, is invaluable for relating 
perturbations at the end of inflation (or some other very early noninflationary epoch) to the 
later times of interest in this chapter. This property is particularly useful given our ignorance 
of the intervening epoch, the period of “reheating” or “entropy generation,” the details of 
which are unknown and highly model dependent. 

To understand the qualitative behavior of the above equation, it is important to focus 
on the role of the Hubble parameter H(t) = a(t)/a(t), which at time t defines the dividing 

f In the theory of cosmological perturbations, the terms “scalar,” “vector,” and “tensor” are given a special 
meaning. A tensor of any rank that can be constructed using derivatives acting on a scalar function and the 
Kronecker delta is regarded as a “scalar”. A quantity that is not a “scalar” but can be constructed in the 
same way from a 3-vector field is regarded as “vector,” and a “tensor” is a second-rank tensor with its “scalar” 
and “vector” components removed. Under these definitions, the linearized perturbation equations reduce to a 
block diagonal form in which the “scalar,” “vector,” and “tensor” sectors do not talk to each other. 
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line between superhorizon modes [for which |k p h ys | H(t)} and subhorizon modes [for which 
[kphysl ^ i/(t)]|^] This distinction relates to the relevance of the spatial gradient terms in 
the evolution equations for the cosmological perturbations. This same distinction may be re¬ 
expressed in terms of the conformal time 77 and the co-moving wavenumber k, where H = a'/a 
takes the place of H = a/a, with superhorizon meaning |k| <J, Hit) and subhorizon meaning 
| k | ;> H(t). Here dots denote derivatives with respect to proper time whereas the primes 
denote derivatives with respect to conformal time. It turns out that for calculations, working 
with conformal time and co-moving wavenumbers is much more convenient than working with 
physical variables. 

What happens to the perturbations as the universe expands depends crucially on whether 
the co-moving horizon size is expanding or contracting. During inflation, or more generally 
when w < —1/3, the co-moving horizon size is contracting. This means that the dynamics 
of Fourier modes, which initially hardly felt the expansion of the universe, become more and 
more affected by the expansion of the universe. Terms proportional to H and H 2 in the 
evolution equations are becoming increasingly relevant whereas the spatial gradient terms are 
becoming increasingly irrelevant as the mode “exits” the horizon, finally to become completely 
“frozen in” on superhorizon scales. During inflation a mode starts far within the horizon and 
crosses the horizon at some moment during inflation. At the end of inflation, all the modes 
of interest lie frozen in, far outside the horizon. 

During inflation, w = p/p was slightly more positive than —1. Formally inflation ends when 
w crosses the milestone w = —1/3, which is when the co-moving horizon stops shrinking and 
starts to expand. This moment roughly corresponds to the onset of “reheating” or “entropy 
generation,” when the vacuum energy of the inflaton held gets converted into radiation, or 
equivalently ultrarelativistic particles, which afterward presumably constitute the dominant 
contribution to the stress-energy X/", so that w ~ 1/3. During the radiation epoch following 
“entropy generation,” modes successively re-enter the horizon again becoming dynamical. The 
degrees of freedom describing the modes are not the same as previously during inflation. This 
is at present our best-bet story of what likely happened in the very early universe. In this 
chapter, we shall be interested in the later history of the universe, when modes are re-entering 
the horizon. 

We shall not enter into the details of how the primordial perturbations are generated 
from quantum fluctuations of the vacuum during inflation, instead referring the reader to the 
chapter by K. Sato and J. Yokoyama on Cosmic Inflation for this early part of the story. Nor 
shall we discuss alternatives to inflation. The original papers in which the scalar perturbations 
generated from inflation were first calculated include Mukhanov 11 f(). Il4ll 139 J, Hawking [83], 
Gutli [76], Starobinsky [218] . and Bardeen et al. [Tl]. For early discussion of the generation of 
gravity waves during inflation and their subsequent imprint on the CMB, see Refs. [HU 11971. 
HSUg, and m- A more pedagogical account may be found for example in the books by 
Liddle and Lyth [ 125] and by Peacock [l49j . as well as in several review articles, including for 
example [129], 

We emphasize the simplest type of cosmological perturbations, known as primordial “adi¬ 
abatic” perturbations. The fact that the perturbations are primordial means that at the late 
times of interest to us, only the growing adiabatic mode is present, because whatever the 
amplitude of the decaying mode may initially have been, sufficient time has passed for its 

g Here “horizon” means “apparent horizon,” which depends only on the instantaneous expansion rate, unlike 
the “causal horizon,” which depends on the entire previous expansion history. 
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amplitude to decay away and thus become irrelevant. The term “adiabatic” deserves some 
explanation because its customary meaning in thermodynamics does not precisely correspond 
to how it is used in the context of cosmological perturbations. In a universe with only adi¬ 
abatic perturbations, at early times all the components initially shared a common equation 
of state and common velocity field, and moreover on surfaces of constant total density, the 
partial densities of the components are also spatially constant. This is the state of affairs ini¬ 
tially, on superhorizon scales, but subsequently after the modes enter the horizon, the different 
components can and do separate. 

Adiabatic modes are the simplest possibility for the primordial cosmological perturbations, 
but they are not the only possibility. Isocurvature perturbations where the equation of state 
varies spatially are also possible, as discussed in detail for example in Refs. [22], [2S] and pB] 
references therein. For isocurvature perturbations, the ratios of components vary on a con¬ 
stant density surface, on which the spatial curvature is constant as well. However the ratios 
between the components are allowed to vary. As the universe expands, these variations in 
the equation of state eventually also generate curvature perturbations. It might be argued 
that the isocurvature modes should generically be expected to have been excited with some 
nonvanishing amplitude in multi-held models of inflation. For recent constraints on isocur¬ 
vature perturbations from the CMB, see Planck Inflation 2013 [I75J and the extensive list of 
references therein. 


4 Characterizing the primordial power spectrum 

In the previous section, we defined the dimensionless invariant C( x ), conserved on superhorizon 
scales, which can be used to characterize the primordial cosmological perturbations at some 
convenient moment in the early universe when all the relevant scales of interest lie far outside 
the horizon. Because this quantity is conserved on superhorizon scales, the precise moment 
chosen is arbitrary within a certain range. The existence of a quantity conserved on super¬ 
horizon scales allows us to make precise predictions despite our nearly complete ignorance of 
the details of reheating. 

Assuming a spatially flat universe, we may expand ((x) into Fourier modes, so that 

/ rl 3 k 

^-pC(k)exp[fk-x], (18) 

If spatial homogeneity and isotropy are assumed, the correlation function of the Fourier coef¬ 
ficients must take the form 

(C(k)C(k')) = (27r) 3 A;- 3 ^(A;)A 3 (k - k'), (19) 

which also serves as the definition of the function V(k) known as the primordial power spec¬ 

trum. The k~ 3 factor is present so that V(k) is likewise dimensionless. An exactly scale 
invariant power spectrum corresponds to V(k) ~ k°. The meaning of scale invariance can be 
understood by writing the mean square fluctuation at a point as the following integral over 
wavenumber: 

/ +OC 

d\Hk)\P(k). (20) 

-oo 

Changes of scale correspond to rigid translations in ln(fc), and the only choice for V(k ) that 
does not single out a particular scale is V{k) = (constant). 
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5 Recombination, the Blackbody Spectrum, and Spec¬ 
tral Distortions 

It is sometimes said that the CMB is the best blackbody in the universe. This is not true. 
Observationally the only way to test how close the CMB is from a perfect blackbody is 
to construct a possibly better artificial blackbody and perform a differential measurement 
between the emission from the sky and this artificial blackbody as a function of frequency. 

In our discussion of the CMB, we stressed that theory predicted a frequency spectrum 
having a blackbody form to a very high accuracy. This indeed was one of the striking pre¬ 
dictions of the big bang theory, and the lack of any measurable deviation from the perfect 
blackbody spectral shape over a broad range of frequencies was probably one of the most 
important observational facts that led to the demise of the alternative steady state cosmolog¬ 
ical model. To date, the best measurements of the frequency spectrum of the CMB are still 
those made by the COBE FIRAS instrument [133, fT35l [53 j. FIRAS [ 1~35] made differential 
measurements comparing the CMB frequency spectrum on the sky to an artificial blackbody. 
Despite the many years that have gone by since COBE and the importance of measuring 
the absolute spectrum of the CMB, no better measurement has been carried out because 
improving on FIRAS would require going to space in order to avoid the spectral imprint of 
the Earth’s atmosphere. However a space mission concept called PIXIE has been proposed 
that would essentially redo FIRAS but with over two orders of magnitude better sensitivity 
and with polarization sensitivity included [l'0'5j . While differences in temperature between 
different directions in the sky, particularly on small angular scales, can be measured from the 
ground, albeit with great difficulty, the same is not possible for measurements of the absolute 
spectrum. 

From a theoretical point of view, even in the simplest minimal cosmological model with no 
new physics, deviations from a perfect blackbody spectrum are predicted that could be mea¬ 
sured with improved observations of the absolute spectrum having a sensitivity significantly 
beyond that of FIRAS. It is often believed that when we look at the CMB, we are probing 
conditions in the universe at around last scattering—that is, around z « 1100—and that the 
CMB photons are unaffected by what happened during significantly earlier epochs. This is 
not true. Although when z i> 1100, photons are frequently scattered by electrons randomizing 
their direction, causing them to move diffusively, this frequent scattering is inefficient at equi¬ 
librating the kinetic temperature of the electrons to the energy spectrum of the photons and 
vice versa owing to the smallness of the dimensionless parameter Ej/m e , where E 7 is a typical 
photon energy. This ratio gives the order of magnitude of the fractional energy exchange due 
to electron recoil for a typical collision. Given that in this limit the approach to a blackbody 
spectrum is a diffusive process, we would estimate that ( E 1 /m e ) 2 collisions correspond to one 
decay time of the deviation of the photon energy spectrum from the equilibrium spectrum. 
Plugging in E 1 ~ A;bT z =iioo ~ 0.3 eV, we get (E 1 /m e ) 2 ~ 4 x 10 10 ! Thomson scattering, 
however, cannot equilibrate the photon number density with the available energy density, so 
with Thomson scattering alone, an initially out of equilibrium photon energy distribution, 
for example as the result of some sort of energy injection, would generically settle down to 
a photon phase space distribution having a positive chemical potential. Processes such as 
Bremsstrahlung and its inverse or double Compton scattering are required to make the chem¬ 
ical potential decay to zero, and these processes freeze out at z ~ 10 6 [95]. Therefore any 
energy injection between z ~ 10 6 and z ~ 10 3 will leave its mark on the absolute spectrum 
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of the CMB photons, either in the form of a so-called ji distortion, or for energy injected at 
later times, in the form of a more complicated energy dependence. 

A broad range of interesting early universe science can be explored by searching for devia¬ 
tions from a perfect blackbody spectrum. Some of the spectral distortions are nearly certain 
to be present, while others are of a more speculative nature. On the one hand, ^-distortions 
in the field (that is, away from galaxy clusters where the ^-distortion is compact and par¬ 
ticularly large) constitute a nearly certain signal, as do the spectral lines from cosmological 
recombination. But sources like decaying dark matter [3TJ may or may not be present. An¬ 
other interesting source of energy injection arise from Silk damping [2071 [89] of the primordial 
power spectrum on small scales, beyond the range of scales where the primordial power spec¬ 
trum can be probed by other means. If a space-based instrument is deployed with the requisite 
sensitivity, a formidable challenge will be to distinguish these signals from each other, and 
also from the more mundane galactic and extragalactic backgrounds. From this discussion 
one might conclude, borrowing terminology from stellar astrophysics, that the CMB “photo¬ 
sphere” extends back to around z ~ 10 6 . 

Exact Anisotropy Cal- 


At early times (z <j 1100) the 
universe is almost completely ion¬ 
ized and the photons scatter fre¬ 
quently with free charged parti¬ 
cles, primarily with electrons be¬ 
cause the proton-photon scattering 
cross-section is suppressed by a fac¬ 
tor of ( m e /m p ). During this period 
the plasma consisting of baryons, 
leptons, and photons may be re¬ 
garded as a single tightly-coupled 
fluid component, almost inviscid 
but with a viscosity relevant on the 
shortest scales of interest for the 
calculation of the CMB tempera¬ 
ture and polarization anisotropies. 
Later, as the universe cools the elec¬ 
trons and protons (and other nu¬ 
clei) combine to form neutral atoms 
whose size greatly exceeds the ther¬ 
mal wavelength of the photons, ren¬ 
dering the universe almost trans¬ 
parent. During this process, known 
as “recombination” (despite the fact that the universe had never previously been neutral), the 
photons cease to rescatter and free stream toward us today [152] . This last statement is only 
90% accurate, because the late-time reionization starting at about z ~ 6-7 causes about 10% 


6 Sachs-Wolfe Formula and More 
culations 



Figure 11: Ionization fraction as function of red- 
shift with effect of late-time ionizing radiation 
ignored. Calculated using the code Cosmo Rec by Jens 
Chluba (for details see Ref. [213] ). 
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of the CMB photons to be rescattered. The effect of this rescattering on the CMB anisotropies 
will be discussed in detail in Sec. 7.5 It is during this transitional epoch, situated around 
z ~ 1100, that the CMB temperature and polarization anisotropies were imprinted. Thus 
when we observe the microwave anisotropies on the sky today, we are probing the physical 
conditions on a sphere approximately 47 billion light years in radius todayj^jor more precisely 
the intersection of our past light cone at the surface of constant cosmic time at 2 ~ 1100. This 
sphere is known as the surface of last scatter or the last scattering surface. 

The transition from a universe that is completely opaque to the blackbody photons, in 
which the tight-coupling approximation holds for the baryon-lepton plasma because the pho¬ 
ton mean free path is negligible, to a universe that is virtually transparent is not instantaneous. 
This fact implies that the last scattering surface (LSS) is not infinitely thin, rather having a 
finite width that must be taken into account for calculating the small angle CMB anisotropies, 
because this profile of finite thickness smears out the small-scale three-dimensional inhomo¬ 
geneities as they are projected onto the two-dimensional celestial sphere. Figure 11 shows a 
plot of the ionization history of the universe (not taking into account the late-time reionization 
at z )> 6 mentioned above). (For a more detailed discussion of the early ionization history of 
the universe and the physics by which it is determined, see for example Refs. [S3, 12021 2031 220] 
and [30]). 

The first calculation of the CMB temperature anisotropies predicted in a universe with 
linearized cosmological perturbations was given by Sachs and Wolfe in their classic 1967 paper 
[2DD] ( see also R- e f- [ 154j ). In their treatment, the LSS surface is idealized as the surface of a 
three-dimensional sphere—in other words, the transition from tight-coupling to transparency 
is idealized to be instantaneous. Locally, on this surface the photon-baryon fluid is subject to 
two kinds of perturbations: (i) perturbations in density <5 7 _& and (ii) velocity perturbations v&. 
The former translate into fluctuations in the photon blackbody temperature T 7 , in “intrinsic” 
temperature fluctuations at the last scattering surface with hT 7 /T 7 and the second translate 
into a Doppler shift of the CMB temperature. If there were no metric perturbations, we would 
simply have 


rni^ma- 1 - K . fl , {21) 

T f Ti c 7 ’ v ' 

where D is the unit outward normal on the last scattering surface. But there are also additional 
terms due to the metric perturbations, and careful calculations along the perturbed geodesics 
yield the following modification to the above equation: 


ST f (Cl) 

Tf 


- -5v 7 • Cl + $ + f dr)(& + T'), 


( 22 ) 


where the metric perturbations are parametrized as in Eq. (15) using conformal Newtonian 
gauge J] This equation is known as the Sachs-Wolfe formula. Because of the approximations 
involved, the Sachs-Wolfe formula in not used for accurate calculations (especially on small 


h The radius of this sphere in terms of today’s comoving units is larger that the age of the universe (ap¬ 
proximately 13.8 Gyr) converted into a distance owing to the expansion of the universe. 

’The Sachs-Wolfe formula as given here only includes the “scalar” perturbations. The formula can be 
generalized to include “vector” and “tensor” contributions, for which the only contributions are from the 
Doppler and ISW (integrated Sachs-Wolfe) terms. 
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angular scales). However, it offers a good approximation to the CMB anisotropies on large 
angular scales and provides invaluable intuition that is lacking in the more precise treatments. 

A few words about gauge de¬ 
pendence^] an issue that renders lin¬ 
earized perturbation theory within 
the framework of general relativity 
somewhat messy. While the final 
observed anisotropy 8Tf(Cl)/Tf is 
the same in all coordinate systems 
(except for the monopole and dipole 
terms), the attribution of the total 
anisotropy among the various terms 
(i.e., intrinsic temperature fluctu¬ 
ation, Doppler, gravitational red- 
shift, integrated Sachs-Wolfe) de¬ 
pends on the choice of coordinates. 
We may for example choose coordi¬ 
nates so that the photon tempera¬ 
ture serves as the time coordinate, 
making the first term disappear, 
or alternatively, we may make the 
Doppler term disappear by making 
our co-moving observers move with 
the local photon rest frame. 

Students of cosmology are some¬ 
times misled into believing the so¬ 
lution to these ambiguities is to 
use conformal Newtonian gauge, or 
equivalently the “gauge invariant” 
formalism, which is equivalent to 
transforming to conformal Newto¬ 
nian gauge. It is believed that this gauge choice is somehow more “physical” because at least 
to linear order, the gauge conditions lead to a unique choice of gauge. It is certainly nice to 
have a gauge condition leading to a unique choice of coordinates, but this uniqueness comes 
at a price. The Newtonian gauge condition is “nonlocal” because of its reliance on the decom¬ 
position of h^ u into “scalar,” “vector,” and “tensor” components, which requires information 
extending all the way out to spatially infinity. But no one has ever seen all the way out to 
spatial infinity. For this reason, the Newtonian potentials T and <f> are unphysical because 
they cannot be measured. Their determination would require information extending beyond 
our horizon. 



Figure 12: Theoretical CMB spectrum. CMB tem¬ 
perature power spectrum predicted for a model with only 
the adiabatic growing mode excited, as in the standard 
concordance cosmological model, is shown. The theoret¬ 
ical model here assumes the best-fit cosmological param¬ 
eters taken from the Planck 2013 Results for Cosmology 

una- 


We may further simplify Eq. (22) assuming that only the adiabatic growing mode has been 


excited. Using the fact that ST t and v are not independent of <P, we obtain the often cited 


J “Gauge dependence” here means invariance under general coordinate transformations, which consistent 
with the linear approximation above may be truncated at linear order. 
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result 


ST 

~T 


$ 

-v • f2 + 

3 


dr){& + &), 


(23) 


although the —1/3 factor is not quite right because a matter-dominated evolution for a(t) is 
assumed rather than a more careful treatment taking into account matter-radiation equality 
occurring at around z eq ~ 3280. (For a discussion see Ref. [255].) 

Using this result and ignoring the integrated Sachs-Wolfe term, we obtain 


ST ~ <F 

— O) =-v SI 

T y ’ 3 


(24) 


On large angular scales—that is, large compared to the angle subtended by the horizon on the 
last scattering surface—the first term dominates. The modes labeled by k obey an oscillatory 
system of coupled ODEs, and at the putative big bang each mode starts with a definite sharp 
phase corresponding to the part of the cycle where v = 0. It is only at horizon crossing that 
the phase has evolved sufficiently for the velocity term to contribute appreciably to the CMB 


anisotropy. Figure 12 shows the calculated form of the CMB temperature power spectrum. 
For £ 100, AT/T ~ —<F/3 provides an adequate explanation for this leftmost plateau of the 

curve, but at higher £ a rise to a first acoustic peak, situated at about £ ~ 220, is observed 
followed by a series of decaying secondary oscillations. 

A more accurate integration of the evolution of the adiabatic mode up to last scattering can 
be obtained in the fluid approximation. This approximation assumes that the stress-energy 
content of the universe can be described by a fluid description consisting of two components 
coupled to each other only by gravity [ 205] . However more accurate calculations must go 


beyond the fluid approximation using a Boltzmann formalism that includes higher order mo¬ 
ments. 

However, before describing the Boltzmann formalism, capable of describing this inter¬ 
mediate regime for the photons, we note one final shortcoming of the approximation with 
tight-coupling and instantaneous recombination—namely, the treatment of polarization. We 
present here an approximate, heuristic treatment in order to provide intuition. 

The scattering of photons by electrons is polarization dependent, and this effect leads 
to a polarization of the CMB anisotropy when the fact that recombination does not occur 
instantaneously is taken into account. The Thomson scattering cross-section of a photon off 
an electron is given by 


da 

dQ, 



(25) 


where e, and if are the initial and final polarization vectors, respectively. 

To see qualitatively how polarization of the CMB comes about, let us for the moment 
assume that the photons coming from next-to-last scattering are unpo- larized and calculate 
the polarization introduced at last scattering, as indicated in Fig. 13. Measuring photons with 
one linear polarization selects photons that propagated from next-to-last to last scattering at 
a small angle to the axis of linear polarization, while photons with the other linear polarization 
tend to come from a direction from next-to-last to last scattering with a small angle with the 
other axis. Consequently, measuring the polarization amounts to measuring the temperature 
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quadrupole as seen by the electron of last scattering. If we make the simplifying assump¬ 
tion that the radiation emanating from next-to-last scattering is unpolarized, we obtain the 
following expression for the Stokes parameters from the linear polarization, 



f P—pmax 


d(—exp[—r(r u . +p)]) 


'p =o 


dtl sin 2 6 



x T(p sin 6 cos </>, p sin 6 sin 0, ry s . + p cos 9). 


( 26 ) 


Here we assume that the line of sight is along the z direction. The expression gives two of 
the five components of quadrupole moment of the temperature as seen by the electron at last 
scattering, and after integration along the line of sight would give the total linear polarization 
with the polarization at next-to-last scattering neglected. 


7 What Can We Learn From the CMB Temperature 
and Polarization Anisotropies? 

The previous Section showed how starting from Gaussian isotropic and homogeneous ini¬ 
tial conditions on superhorizon scales, the predicted CMB temperature and polarization 
anisotropies are calculated, describing in detail all the relevant physical processes at play. In 
this Section, we turn to the question of what we can learn about the universe from these ob¬ 
servations. We focus on how to exploit the temperature and polarization two-point functions, 
which under the assumption of Gaussianity would summarize all the available information 
characterizing the underlying stochastic process. Gaussianity is a hypothesis to be tested 
using the data, as discussed separately in Sec. [9j 

7.1 Character of primordial perturbations: adiabatic growing mode 
versus field ordering 

In the aftermath of the COBE/DMR announcement of the detection of the CMB tempera¬ 
ture anisotropy, two paradigms offered competing explanations for the origin of structure in 
the early universe. On the one hand, there was cosmic inflation, which in its simplest in¬ 
carnations predicts homogeneous and isotropic Gaussian initial perturbations where only the 
adiabatic growing mode is excited. On the other hand, there was also another class of models 
in which the universe is postulated initially to have been perfectly homogeneous and isotropic. 
However, subsequently a symmetry breaking phase transition takes place, with the order pa¬ 
rameter held taking uncorrelated values in causally disconnected regions of spacetime. Then, 
as the universe expands and the co-moving size of the horizon grows, the held aligns itself 
over domains of increasingly large co-moving size, generally in a self-similar way. In many of 
these models, topological defects of varying co-dimension—such as monopoles, cosmic strings, 
and domain walls—arise after the phase transition, but textures where the spatial gradient 
energy is more diffusely distributed are also possible^] In these models the contribution of the 
held ordering sector to the total energy is always subdominant, and cosmological perturba¬ 
tions are generated in a continuous manner extending to the present time. The stress-energy 

k For the physics of topological defects, see Ref. [1881 and for a comprehensive account emphasizing the 
connection to cosmology, see Ref. 8 226) . 
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Figure 14: Structure of acoustic oscillations: field ordering predictions. These acous¬ 
tic oscillation predictions from four field-ordering models are to be contrasted with the sharp, 
well-defined peaks in Fig. 12 as predicted by inflation. (Reprinted with permission from 
Ref. |155j .) ( Credit: Pen, Seljak and Turok ) 


© MI/ from this sector sources perturbations in the metric perturbation h^. These metric per¬ 
turbations in turn generates perturbations in the dominant components contributing to the 
stress-energy: the baryons, photons, neutrinos, and cold dark matter. In these models, the 
cosmological perturbations are not primordial, but rather are continuously generated so that 
the perturbations on the scale k are generated primarily as the mode k enters the apparent 
horizon [156] . This property implies that the decaying mode as well as the growing mode of 
the adiabatic perturbations is excited, and this fact has a spectacular effect on the shape of 
the predicted CMB power spectrum. While inflationary models, which excite only the grow¬ 
ing mode, predict a series of sharp, well-defined acoustic oscillations, held ordering models 
predict broad, washed out oscillations or no oscillations at all in the angular CMB spectrum 
O [36], 155j . Heuristically one can understand this behavior by arguing that the positions of 
the peaks reflect the phase of the oscillations. Therefore, if both the growing and decaying 
modes are present in exactly equal proportion, there should be no oscillations in the angular 
spectrum. However, precise predictions require difficult numerical simulations and depend on 
the precise model for the topological defect or held ordering sector. Figure [l4| shows the shape 
of the predicted CMB TT power spectrum for some held ordering models, to be compared 
with the predictions for minimal inflation shown in Fig. [12} 

After COBE, a big question was whether improved degree-scale CMB observations would 
unveil the hrst Doppler peak followed by a succession of decaying secondary peaks at smaller 
angular scales, as predicted by an approximately scale invariant primordial power spectrum 
for the growing adiabatic mode, or whether some other shape would be observed, perhaps the 
one predicted by a held ordering model. 
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NLS 



We already presented (in Sec. [T]) 
the experimental results for the first 
observations of the acoustic oscilla¬ 
tions and their subsequent precise 
mapping (see in particular Figs. [5j 
The data clearly fa- 


and 10 


vor simple models of inflation pro¬ 
ducing adiabatic growing mode per¬ 
turbations with an approximately 
scale invariant power spectrum and 
exclude scenarios where topological 
defects serve as the primary source 
for the initial density perturbations. 
However, models including a small 
admixture of defect induced pertur¬ 
bations cannot be ruled out HZB3- 

7.2 Boltzmann hierarchy 
evolution 

In the previous section, we derived 
the Sachs-Wolfe formula, which 
provides an intuitive understand¬ 
ing of how the CMB anisotropies 
are imprinted. The Sachs-Wolfe 
treatment provides an approximate 
calculation of the CMB tempera¬ 
ture anisotropies, correctly captur¬ 
ing their qualitative features. How¬ 
ever to calculate predictions accu¬ 
rate at the sub-percent level, as 
is required for confronting current 
observations to theoretical mod¬ 
els, the transition from the tight- 
coupling regime to transparency, or 
the free-streaming regime, must be modeled more realistically. This requires going beyond 
the fluid approximation, where the photon gas can be described by its first few moments. 
Modeling this transition faithfully requires a formalism with an infinite number of moments— 
that is, a formalism in which the fundamental dynamical variable is the photon phase space 
distribution function /(x, t, u, n, e) where u is the photon frequency, and the unit vectors n 
and e denote the photon propagation direction and electric field polarization, respectively. 
The perturbations about a perfect blackbody spectrum are small, so truncating at linear or¬ 
der provides an adequate approximation. Moreover, Thomson scattering is independent of 
frequency, and thus (to linear order) does not alter the blackbody character of the spectrum. 
Consequently, a description in terms of a perturbation of the blackbody temperature that de¬ 
pends on spacetime position, propagation direction, and polarization direction can be used in 


Figure 13: Anisotropy of polarized Thomson scat¬ 
tering and origin of the CMB polarization. We 

show two examples of photon scattering from (NLS) 
(next-to-last scattering) to (LS) (last scattering) and fi¬ 
nally to the observer (at O .) The three segments in the 
diagram have been chosen mutually at right angles in or¬ 
der to maximize the effect, and for simplicity we assume 
that the radiation emanating from (NLS)i and (NLS )2 is 
unpolarized. All the radiation that scatters coming from 
(NLS)i to (LS) and then towards O is completely linearly 
polarized in the 62 direction, because the other polariza¬ 
tion is parallel to (LS)O and thus cannot be scattered in 
that direction. Likewise, the scattered photons emanat¬ 
ing from (NLS )2 are completely linearly polarized in the 
orthogonal direction for the same reason. Consequently 
if there were only two sources as above, measuring the 
Stokes parameter Q at O amounts to measuring the dif¬ 
ference in intensity between these two sources. 
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place of the full six-dimensional phase space density, thus reducing the number of dimensions 
of the phase space by onef] For a calculation correct to second order, a description in terms 
of a perturbed temperature would not suffice. 

As before we consider a single “scalar” Fourier mode k of flat three-dimensional space, so 
that an exp[?'k • x] spatial dependence is implied. A complication in describing polarization 
arises in specifying a basis, which preferably is accomplished using only scalar quantities. We 
describe the polarization using the unit vectors n and k to define the unit vectors 


- k — h(n ■ k) - nxk 

||k — h(n ■ k)|| ||n x k|| ’ 

which together with photon propagation direction n form an orthonormal basis, 
describe the Stokes parameters of the radiation propagating along n as follows: 

(£ a (n, v)£ b ( n, v)) = / (n, u; k)<5 afe + Q(h, z/; k) [0 ® 6 - 0 <g) 0] a6 . 


(27) 
We thus 

(28) 


For the “scalar” mode, the Stokes parameters U and V vanish in this basis. 

The Thomson scattering cross-section does not vary with frequency, a property that pro¬ 
vides substantial simplification. Suppose that at some initial time the photon distribution 
function has a blackbody dependence on frequency, with a blackbody temperature suffering 
only small linearized perturbations about an average temperature, and that these temperature 
perturbations depend on the spacetime position, propagation direction, and linear polariza¬ 
tion direction. The frequency independence of Thomson scattering ensures that a photon 
distribution function initially having this special form will retain this special form during its 
subsequent time evolution. Initially, when tight-coupling is an excellent approximation, any 
deviation in the frequency dependence from a perfect blackbody spectrum decays rapidly. 
Moreover, any initial polarization is erased by the frequent scattering. Therefore a blackbody 
form described by a perturbation in the spectral radiance of the form 


SI v (h, e,x, t) = T 7 


dB v (T) 

dT 


A ab (n,x,t)e 


is well motivated. Here 


B„(T) 



(29) 


(30) 


is the spectral radiance for a blackbody at temperature T. The unperturbed spectral radiance 
is 7„ i0 (n, e,x, t) = H iy (T 7 (t)), where T 7 (t) is the photon temperature at cosmic time t in the 
unperturbed background expanding FLRW spacetime. (In Sec. [5j however, we discuss some 
interesting possible caveats to this assumption.) For the “scalar” mode of wavenumber k, we 
may decompose 

A ab (n,x,t) = Aj cl) (n,x,f)(5 af) + Aj cl) (h,x,f)(0 a 0 b - 0 a 0 fe ). (31) 

'Although written as a continuum variable, e is in reality a discrete variable corresponding to the Stokes 
parameters ( E,Q and U). 
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We now turn to the evolution of the variables A^ cl \h, t) and Ap Cl ^(n, t). It is convenient, 
exploiting the symmetry under rotations about k, to expand 


Ap Cl) (n, f; k) = A^ d) (f; k)(-i) e (2£ + l)P e (k • n), 

1=0 
oo 

Ap Cl ^(n, £; k) = ^2 Ap d) (t; k)(-i) £ (2£ + l)P^(k • n). 
1=0 


(32) 


The evolution of Ap^ and A p C ^ is governed by the following inhnite system of equations 

m- 


A 

A 


(scl) _ 
TO — 

(scl) _ 
T1 — 


-hA™+ 0, 


3 ( A to 1] - 2A t? + VO + a{t)n e {t)a T (y - A^ 1} ), 


A 


(scl) 
T 2 


A 


(scl) _ 


Tl 


k 

-(2A 

k 

J2£ 


(scl) 

Tl 


3A 


(scffi 


t 3 ) + a(t)n e (t)a T ( y + A. 


n 


(scl) 

T2 


yyy{^ A T S (J-i) - ( £ + 1 ) A t5+i)> - a{t)n e (t)a T A^ l) , 


(scl) 


(scl) 


for i > 3, 


A 


(scl) 

PI 


k 


—{TAp^i) ~{i + l)Ap^ ; +1) } - a(t)n e (t)a T Apg 

for l > 0, 


(scl) 


(scl) 


(2d + 

-a(t)n e (t)a T H ( S tQ + -6 it2 


where 


n = A^ 1} + A l pn ; + A 


(scl) 


TO 


P 0 


(scl) 

ip 2 • 


(33) 


(34) 


These equations are known as the Boltzmann hierarchy for the photon phase space distribution 
function. In the last line of Eq. (33) 5 a f> denotes the Kronecker delta function. For practical 
numerical calculations, this inhnite set of coupled equations must be truncated at some £ max . 

In principle, ignoring questions of computational efficiency, we could truncate the system 
of equations in (33) at some large £ max , sufficiently above the maximum multipole number 


of interest today, and integrate the coupled system from some sufficiently early initial time 
to the present time t 0 to find the “scalar” temperature and polarization anisotropies today, 
which would be given by the following integrals over plane wave modes: 


Ap cl) (n,f 0 ) = j d 3 A;Ap Cl) (h, f 0 ; k), 
Ap cl) (h,t 0 ) = J d 3 fcAp Cl) (n, f 0 ; k). 


(35) 


This was the approach used in the COSMICS code [16], one of the first codes providing 
accurate computations of the CMB angular power spectrum. However in this approach much 
computational effort is expended at late times when there is virtually no scattering. 

A computationally simpler but mathematically completely equivalent approach known as 
the line of sight formalism was introduced in Ref. [206]. The crucial idea is to express the 
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anisotropies today (at x = 0 and rj — rj — 0) in terms of a line of sight integral, so that 


A t\ n) = 


rvo 


X 


di] exp[—exp [ik(rj — r ) 0 )] 

^A to + %1-IVb + ^P 2 (/i)n ) + (<// - ik/j.'ifj) 


and 


A^ s) (n) 


no / d T ' 

I dr]exp[-r(r])}exp[ik(r] - 7] 0 )} f- — ) (1 - P 2 (n))U 


(36) 

(37) 

(38) 


,(S) 


+ ApQ. We may think of exp[—r(r/)](— dr/difjdr] = V(rj)dr] as a 


where II = A P2 + A P2 
measure along the line of sight weighted according to where last scattering takes place. We 


shall call V(r/) the visibility function. In Eq. (37), the Erst term represents the nonpolarized 


contribution at last scatter and the second term represents the contribution from the integrated 


Sachs-Wolfe effect. In Eq. (38) for the polarized anisotropy, the only contribution is from last 
scatter. It is convenient to rewrite the above integrals using integration by parts so that 
all spatial derivatives along the line of sight and occurrences of /j, are eliminated, and in the 
process we also omit the monopole surface term at the endpoint corresponding to the observer 
as this contribution is not measurable. After this integration by parts, the integrals above 
take the form 


^T,p(ri) 


"Vo 


dr) exp\ikfi(r)o - - v)n, r)), 


(39) 


where the scalar source functions are 


5, 


(S) 


/ v > arr'h 

= exp[—r(r^)](0 / + V>') + V ^A ro + ^ j 


+ v '(i + S) + v "ip. 
s ( P s) = -|^(v(fc 2 n + n") + 2V'n' + v"n), 


(40) 


and the primes denote derivatives with respect to conformal time r/. Now that the spatial 
derivatives have been eliminated, it is no longer necessary to use the plane waves exp[i k/j.^tjo — 
7])} as the eigenfunctions of the Laplacian operator with eigenvalue — k 2 . It is convenient 
instead to use a spherical wave expansion with je(k(r] — ?/) 


Afj = J drjj e (k ((.Vo ~ ( 41 ) 

where A = T,P. 

The line of sight formulation results in a significant reduction of the computational effort 
required for computing the high £ coefficients for two reasons. First, the Boltzmann hierarchy 
can be truncated at low £, because only moments up to £ = 2 appear in the integral. Second, 
when the universe has become transparent, there is no longer any contribution from the 
Thomson scattering. 
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We express the initial state for the growing adiabatic mode at some early reference time 
whose exact value must be chosen after the epoch of entropy generation following inflation 
but before any of the relevant modes re-entered the horizon. We use the variable £(x), whose 
value is conserved on superhorizon scales, to characterize the primordial perturbations in the 
adiabatic growing mode. We may expand, either using plane wave Fourier modes or a spherical 
wave expansion, so that 

C(x) = [ d 3 k((k) exp[ik • x] (42) 


or 

v poo 

CM = £ £ / dkCe,m(k)je(kr)Y em (9, 0 ). 

£=0 m=—l 


(43) 


We characterize the homogeneous and isotropic Gaussian statistical ensemble for £(x) by 
means of a power spectrum V(k ), which suffices to completely define the statistical process 
generating £(x). We define V(k) according to the following expectation values: 

(C(k)C(k 7 )) = (27r) 3 5 3 (k — k')V(k), (44) 


or equivalently 


(Cem(k)Ce>m>(k')) = 5^5 m>m '5{k - k')T{k). 


It follows that 


so that 


(S),T 

4m 


poo prjo 

I dk j dt]ji(k(t]o - v) )St\v, bC*»M> 


cf ] ' TT =<[45' T r42- T > = / dkv(k) a?" ( fc) . 


(S), T /1 \ A (S),T / 


where 


pfj o 

A f S),T M = / dr]j t [k(t] a -riyjS^ ) (r],ky 


(45) 


(46) 


(47) 


(48) 


The above approach was first implemented in the publicly available code CMBFAST |206] and 
later in the code CAMB mn More details about the computational issues may be found 
in these papers. 


7.3 Angular diameter distance 


The radius of the last scattering surface expressed in terms of co-moving units today (for 
example in light-years or Mpc) is given by the integral 


xis = H { 


-l 


da 


a l6 


a 2 y/£T r 


7-4 


T kl m a ^ ~\~ 12/ 


( 49 ) 


^For a download and information on CAMB, see www.camb.info. 
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for a universe whose stress-energy content includes radiation, matter, and a cosmological 
constant. Here the scale factor is given by ai s = (zu + 1) _1 . For other equations of state 
the argument of the square root above is modified accordingly. A physical scale d at last 
scattering is converted to a co-moving scale today x according to x = ( zi s + 1 )d, and under 
the assumption of a flat spatial geometry, the angle in radians subtended by an arc of length 
d on the last scattering sphere is 6 = ( d/di s ). 


Except for the integrated Sachs-Wolfe effect (discussed below in Sec. 7.4), which is very 
hard to measure because the S/N is negligible but for the very Erst multipoles, the projection 
effect described above is the only way in which late-time physics enters into determining the 
angular power spectrum. Possible spatial curvature, parametrized through the details of a 
possible quintessence held, or similar new physics that alter the expansion history at late time 
enters into the CMB power spectra only through a rescaling of the relation between angular 
scales, on the one hand, and physical scales, on the other, around last scattering, and this 
rescaling can be encapsulated into a single variable di s . 

If the family of FLRW solutions extended to admit spatial manifolds of constant positive 
or negative curvature, the expression for the angular size must be modified to the following 
for a negatively curved (hyperbolic) universe 


9 = 


Vl /2 d ls 


d 


sinh[fij|/ 2 d/ s ] du 


(50) 


or in the case of positive curvature (i.e., a spherical universe) 


(-n k )V 2 d l8 d 

sin[(-H fc ) 1 / 2 d Zs ] dis 


(51) 


In the former case, the additional prefactor giving the contribution for the non-Euclidean char¬ 
acter of the geometry has a demagnifying effect and increases the angular diameter distance, 
whereas for the spherical case the effect is the opposite. 

In the early 1990s, many theorists working in early universe cosmology fervently believed 
on grounds of simplicity in a cosmological model with Q k = 0 (supposedly an inexorable 
prediction of inflation) and no cosmological constant (regarded as extremely finely tuned 
and “unnatural”) containing only matter and radiation so that fl r + = 1 exactly. This 

belief was maintained despite a number of discordant astronomical observations. Many in the 
astronomical community, however, rather adopted the attitude “I believe only what I see,” and 
thus favored some kind of a low-density universe, either a negatively curved universe where 
fifc = 1 — fi m — fi r or a universe with a nonzero cosmological constant, where Ha = 1 —H m — H r J^] 

Although it was widely claimed that spatial flatness was a prediction of inflation, the 
basic idea of how to produce a negatively curved universe within the framework of inflation 
had already been proposed in 1982 by Gott and Statler (731113;. However, since they had 
not calculated the perturbations predicted in such a model, it was not clear that such a 
model would work. In 1995 Bucher et al. (2B1 EK and Yamamoto et al. USD calculated 
the perturbations for single bubble open inflation, showing them to be consistent with all 
the observations available at the time. Present constraints from the CMB limit |fi*. — 1| to a 
few percent at most, the exact number depending on the precise parametric model assumed 

damns]. 


"For a snapshot of the debates in cosmology in the mid-1990s, see the proceedings of the conference Critical 
dialogues in cosmology held in 1995 at Princeton University |222j . 
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7.4 Integrated Sachs-Wolfe effect 


The discussion above claimed that the impact of late-time physics on the CMB power spectrum 
could be completely encapsulated into a single parameter: the angular diameter distance to 
last scattering di s . This is only approximately true, because the geometric argument above 
assumed that all the CMB anisotropy can be localized on the last scattering surface and 


ignored the integrated Sachs-Wolfe term in Eq. (22), which gives the part of the linearized 
CMB anisotropy imprinted after last scattering. 

Intuitively, the integrated Sachs-Wolfe term can be understood as follows. The Newtonian 
gravitational potential blueshifts the CMB photons as they fall into potential wells and simi¬ 
larly redshifts them as they again climb out. If the depth of the potential well does not change 
with time, the two effects cancel. In this case, there is no integrated Sachs-Wolfe contribution 
because the integral can be reduced to the sum of two surface terms. But if the depth of the 
potential well changes with time, in particular if the overall scale of the potential is decaying, 
the two effects no longer cancel and an integrated Sachs-Wolfe contribution is imprinted. 

In the linear theory, the gravitational potential at late times may be factorized in the 
following way: 


d>(x,f) = <f>(x)T(f), (52) 

and we may normalize T(t) to one when the universe has become matter dominated, but 
before the cosmological constant—or some other dark energy sector—has started to alter the 
expansion history. 

From the CMB anisotropies alone, it is hard to determine the contribution of the integrated 
Sachs-Wolfe effect to the total CMB anisotropy on large angular scales. However on can try to 
isolate its contribution by measuring the cross-correlation of the CMB temperature anisotropy 
with the large-scale structure using a broad radial window function extending to large redshifts 
as was first proposed in Ref. B3, whose aim was to find evidence for a nonzero cosmological 
constant. 

The evolution equation for the linearized density contrast of a single pressureless matter 
component is [ 151 ] 


L(t) + 2 H(t)S m (t) - -H 2 (t)(l m (t)S m (t) = 0, 


(53) 


where the dots denote derivatives with respect to proper time and H(t) = d{t)/a{t). Here 
we treat the baryons and the CDM (cold dark matter) as a single component ignoring any 
relative velocity between them. This is not a bad approximation at late times, particularly 
on very large scales. 

For an Einstein-de Sitter universe (for which fi r is negligible and Ha = 0), which is a good 
approximation for our universe at intermediate redshifts, because for z <, z eq ~ 3.4 x 10 4 
radiation is unimportant and after the decoupling of the photons and baryons, it is valid to 
set the velocity of sound equal to zero as above. Let z m a = 1/(1 + a m a) be the redshift 
where the nonrelativistic matter and the putative cosmological constant contribute equally to 
the mean density of the universe, so that H m = a m A 3 /(l + a m A 3 ) and Ha = 1/(1 + a m a 3 ). 
For Ha = 0.68, for example, we would have z m a = 0.3 and a m a = 0.78. For the Einstein-de 
Sitter case, a(t) = t 2 / 3 and the growing and decaying solution to Eq. (53) are t 2 / 3 and f _1 , 
respectively. (l/a 2 )V 2 </> = (3/2 )H 2 Q m 5 m , and for the growing mode <E>(x) is independent of 
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time, implying that there is no integrated Sachs-Wolfe contribution at late times. However, 
as the cosmological constant begins to take over and Ha starts to rise toward one, starts to 


decay, as sketched in Fig. 15 


As explained in detail in Ref. (37j, 
the maximum signal-to-noise (S/N) 
that can be extracted from the ISW 
correlation, even with an ideal mea¬ 
surement of the large-scale struc¬ 
ture anisotropy within our past 
light cone, is modest because of 
the “noise” from the primary CMB 
anisotropies emanating from the 
last scattering surface. This cos¬ 
mic variance noise cuts off the avail¬ 
able S/N because on small angu¬ 
lar scales, the relative contribution 
from ISW plummets. The S/N is 
centered around i ~ 20 and inte¬ 
grates to about S/N ~ 6.5 (assum¬ 
ing Hao = 0.7). For an early detec¬ 
tion of a correlation, see Ref. [55], 
and for the most recent work in this 
area see Ref. HE] and references 
therein. 

7.5 Reionization 

According to our best current un¬ 
derstanding of the ionization his¬ 
tory of the universe, the early ion¬ 
ization fraction around recombina¬ 
tion is accurately modeled assum¬ 
ing homogeneity and using atomic rate and radiative transfer equations. As a rough estimate, 
one may assume the reionization fraction predicted by equilibrium thermodynamics given by 
the Saha equation, in which there is exponential sensitivity to changes in temperature. After 
x ~ which occurs at about z ~ 1270, the ionization fraction x(z) rapidly decays to al¬ 
most zero, except for trace amounts of ionized hydrogen (denoted as HII in the astrophysical 
literature) and of free electrons, due to the inefficiency of recombination in the presence of 
trace concentrations. If it were not for inhomogeneities and structure formation, this nearly 
perfect neutral phase would persist to the present day. However, the initially small primordial 
inhomogeneities grow and subsequently evolve nonlinearly. The first highly nonlinear gravi¬ 
tationally collapsed regions give rise to the first generation of stars and quasars, which serve 
as a source of UV radiation that reionizes the neutral hydrogen almost completely, except in 
a minority of regions of extremely high gas density, where recombination occurs sufficiently 
frequently to counteract the ionizing flux of UV photons. 

The first observational evidence for such nearly complete reionization came from observa- 



Figure 15: Decay of gravitational potential for 
late-time integrated Sachs-Wolfe effect. We sketch 
the time dependence of the gravitational potential for a 
universe with Hao = 0.9, 0.8, 0.7, and 0.6 (from top to 
bottom on the right) with a = 1 today. The qualita¬ 
tive behavior will be similar for other kinds of the dark 
energy. 
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tions of the spectra of quasars at large redshift in the interval 10.4eV/(l + z quasar ) < E 1 < 
10.4 eV—that is, blueward of the redshifted Lyman a line ESI- If the majority of the hydrogen 
along the line of sight to the quasar were neutral, resonant Lya (2 p —y Is) scattering would 
deflect the photons in this frequency range out of the line of sight, and this part of the spec¬ 
trum would be devoid of photons. The fact that the spectrum is not completely blocked in this 
range indicates that the universe along the line of sight was reionized. Almost complete ab¬ 
sorption by neutral hydrogen—that is, the so-called Gunn-Peterson trough—was not observed 
in quasar spectra until 2001, when spectroscopic follow-up of several high-redshift quasars dis¬ 
covered by the Sloan digital sky survey (SDSS) detected such a trough from z ~ 5.7 to z ~ 6.3, 
where the continuum emission blueward of the Lya appears to have been completely absorbed 
by neutral hydrogen j 1~T . 

The big question here is exactly when did the universe first become reionized. Or said 
another way: When did the first generation of stars and quasars form and become capable of 
generating enough ionizing radiation to convert the neutral gas to its present ionized state? 

This is a vast subject of very current research, sometimes described as the exploration of 
the “Dark Ages” of the universe. While we believe that we know a lot about the conditions 
around the time when the CMB anisotropies were imprinted, and we also know a lot about 
the recent universe extending to moderate redshift, little is known with any degree of certainty 
concerning this intermediate epoch. Good reviews of this subject include Ref. ra- Future 
observations of the 21 cm hyperhne transition of atomic hydrogen (HI) by the next generation 
of radio telescopes, in particular the Square Kilometer Array (SKA)J^] promise to provide 
three-dimensional maps of the neutral hydrogen in our past light cone. 

Here we limit ourselves to discussing the impact of reionization on the CMB anisotropies. 
The most simplistic model—or caricature—of reionization postulates that the ionization frac¬ 
tion x(z) changes instantaneously from x = 0 for z > £ reion to x = 1 for z < £ re i on - Under this 
assumption, the optical depth for rescattering by the reionized electrons is 


r = 



dz<JTn e (z) 


d£ 

dz 


(54) 


The Thomson scattering optical depth from redshift z to the present day is given by the 
integral 


car 

f 1 da n e (a) 

Ho J 

C(J TTl e 0 j 

L ( z ) a (O r a- 4 + O m cr 3 + 0 A ) 1/2 
f 1 da x e (a) 

Ho J , 

2 ( 2 ) a 4 (O r a -4 + fl m a~ 3 + Oa) 1/2 ’ 


(55) 


where a(z) = l/(z + 1), (Jt is the Thomson scattering cross-section, H 0 is the present value 
of the Hubble constant (in units of inverse length), and n e o is the electron density today, 
assuming x e — 1. The functions n e (a ) and x e (a) are the electron density^ and the ionization 
fraction at redshift z — 1/a — 1, respectively. 

On small angular scales, the effect of reionization on the predicted CMB temperature 
and polarization anisotropy spectra is straightforward to calculate. Of the CMB photons 


°https: //www.skatelescope.org/. 

p It is customary to define x e as n e /ns, so that x e slightly exceeds one when the helium is completely 
ionized as well. 
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emanating from the LSS, a fraction exp(—r) does not undergo rescattering by the reionized 
electrons and their fractional temperature perturbations are preserved. However, for the 
remaining fraction (1—exp(—r)) ~ r that is rescattered, the small-angle CMB anisotropies are 
completely erased. The reionized electrons nearly completely wash out small-scale detail much 
like a window pane of ground glass. The net effect is that the small-scale CMB anisotropies 
are attenuated, by a factor of exp(—r) in amplitude and by a factor of exp(—2r) in the power 
spectrum. On larger angular scales, the attenuation of the anisotropy from the rescattered 
photon component is incomplete, and a more detailed calculation is needed to determine the 
detailed form of the ^-dependent attenuation factor. The order of magnitude of the angular 
scale where the attenuation starts to become incomplete is given by the ratio of the co-moving 
distance travelled from next-to-last to last scattering by a rescattered photon, d reion , to the 
radius of the LSS g?ls- We thus obtain an angular scale 9 ieion ~ d re i on /dLs for this transition. 

For determining the cosmological model from the Ctt power spectrum alone, the reion¬ 
ization optical depth is highly degenerate with the overall amplitude of the cosmological 
perturbations A. This is because except on very large scales, where cosmic variance intro¬ 
duces substantial uncertainty, it is the combination Aexp(—2r) that determines the observed 
amplitude of the CMB angular power spectrum. 

The polarization power spectra, meaning C EE and also C TE , help to break this degeneracy 
by providing a rather model independent measurement of r re i on . Recall our order of magnitude 
estimate for the polarization introduced by the finite thickness of the LSS 

d 2 T 

p ~ d2 ^’ < 56 > 


where d and x are expressed in co-moving units and d is the mean co-moving distance that the 
photon travels from next-to-last scattering to last scattering. The second derivative is that 
of the temperature anisotropy evaluated at the position of a typical electron of last scatter. 
Let us compare the polarization imprinted during recombination to that imprinted later for 
the fraction of photons (1 — exp(—r)) ~ r rescattered by the free electrons from reionization, 
for the moment assuming that the second derivative factors are comparable for the two cases. 
However, d rec <C d re ion; therefore, we may expect that -P r eion/-P re c ~ r (d iec / d re ion) 2 3> 1. This 
polarization from reionization, however, is concentrated almost exclusively in the first few 
multipoles, because directions separated by a small angle see the same quadrupole anisotropy 
from last scattering from the vantage point of a typical free electron after reionization. Fig¬ 


ure 


16 shows the scalar power spectra for four values of r keeping the other cosmological 


parameters fixed. We shall see that the same amplification applies to the tensor modes, or 
primordial gravitational waves, presumably generated during inflation. 

We now turn to observations allowing us to fix the value of r using the low £, C EE spectra 
and C TE spectra based on the qualitative theoretical arguments above but made more precise 
by calculating r within the framework of the previously described six-parameter concordance 
model. While the first observations of the polarization of the CMB were reported by the DASI 
experiment [llO j . it was the WMAP large-angle polarization results, first the TE correlations 
reported in the one-year release, later followed by the EE correlation results from the third- 
year results, that provided the first determination of r reion using the CMB, and subsequently 
refined in later updates to the WMAP results with more integration time. As part of their 
first-year results [2¥0] . the WMAP team reported a value for the reionization optical depth 
of r = 0.17 ± 0.04 at 68% confidence level, which would correspond to 11 < z r < 30 at 
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95% confidence where a step function profile for the ionization fraction is assumed. This 
determination was based on exploiting only the measurement of the TE cross-correlation 
power spectrum, because for this first release the analysis of the polarized sky maps was not 
sufficiently advanced to include a reliable EE auto-correlation power spectrum measurement. 
The reported value of r was larger than expected. In their third-year release [ 246] . when 
WMAP presented a full analysis of the polarization with EE included, the determination of 
r shifted downward, namely to r = 0.10 ± 0.03 (using only EE data) and r = 0.09 ± 0.03 
with (TT, TE and EE all included). The final nine-year WMAP release [260] cites a value 
of r « 0.089 ±0.014 using WMAP alone. Including other external data sets results in small 

The Planck 2013 Results for 
Cosmology [1651 11701 [175 ] release 
did not include an analysis of the 
Planck polarization data. There¬ 
fore the likelihoods used to deter¬ 
mine the cosmological parameters 
included the WMAP determination 
of r as a prior , and in some cases 
the WMAP low-1 7 polarization like¬ 
lihood was used in order to break 
the degeneracy between r and A de¬ 
scribed above that arises when only 
temperature data is used. 

It has been pointed out that 
reionization histories, which are 
completely characterized by func¬ 
tion x e (z), cannot be reduced to 
a single number. Some work has 
been done to characterize what fur¬ 
ther information can be extracted 
from the low-£ CMB concerning the 
reionization history, as described in 
Refs. [93] and m, but the con¬ 
clusion is that from the CMB data 
at most a few nonzero numbers can 
be extracted at S/N ;> 1 given the 
large cosmic variance at low l. 

In the above discussion we have assumed that reionization occurs homogeneously in space, 
an approximation adequate for calculating its effect on the anisotropies at large and inter¬ 
mediate angular scales. However, according to our best understanding, the universe becomes 
reionized by the formation of bubbles, or Stromgren spheres, surrounding the Erst sources of 
ionizing radiation, which grow and become more numerous, and eventually percolate. While 
this basic picture is highly plausible, the details of modeling exactly how this happens remain 
speculative. The effect on the CMB is significant only at very small angular scales, far into 
the damping tail of the primordial anisotropies. Power from the low-£ anisotropies is trans¬ 
formed into power at very high t. This can be understood by considering the power in Fourier 


shifts about this value. 



Figure 16: CMB power spectra for four values of 
the reionization optical depth. We plot the indi¬ 
cated power spectra for r = 0.00, 0.04, 0.08 and 0.12 as¬ 
suming the same concordance values for the other cosmo¬ 
logical parameters. We multiply the spectra by exp(2r) 
in order to remove the trivial dependence on r at small 
angular scales and thus highlight what occurs at low t. 


37 














space resulting from the rather sharp bubble walls, which in projection look almost like jump 
discontinuities. Some simulations of this effect, along with other effects at large £, can be 
found for example in Ref. [262]. 

7.6 What we have not mentioned 

The preceding subsections provide a sampling of what can be learned from the CMB TT and 
polarization power spectra with each point discussed in some detail. For lack of space, we 
cannot cover all the parameters and extensions of the simple six-parameter concordance model 
that can be explored, and here we provide a few words regarding what we have not covered 
giving some general references for more details. 

The structure of the acoustic oscillations and the damping tail (relative heights, positions, 
etc.) of the CMB power spectra depends on the cosmological parameters, as forecast and 
explained pedagogically in Refs. [90,(97], and most recently applied to the Planck 2013 data 
for determining the cosmological parameters cm Some of the constraints are obtained in 
the framework of the standard six-parameter concordance model, but many extensions to 
this model can be constrained, such as theories with varying «qed, nonminimal numbers of 
neutrinos, varying neutrino masses, isocurvature modes, nonstandard recombination, to name 
just a few examples. Moreover the question can be addressed whether there exists statistically 
significant evidence in favor of extending the basic six-parameter model to a model having 
additional parameters. See Refs. B2D] and [ 175] for an extensive set of references. 


8 Gravitational Lensing of the CMB 

The previous sections presented a simplified view of the CMB where the CMB anisotropies 
are imprinted on the so-called last scattering surface, situated around z ~ 1100. It is assumed 
that we look back to this surface along straight line geodesics in the unperturbed coordinate 
system. This is not a bad approximation, but it is not the whole story. Inhomogeneities 
in the distribution of matter, particularly at late times when the clustering of matter has 
become nonlinear, act to curve the photon, or geometric optics, trajectories. These nonuniform 
deflections distort the appearance of the surface of last scatter, much like a fun house mirror. 
Gravitational lensing of the CMB is a small effect but large enough so that it has to be taken 
into account in order to compare models of the primordial universe with the observations 
correctly. In one sense, gravitational lensing may be viewed as a nuisance effect to be removed. 
But gravitational lensing of the CMB may also be regarded as an invaluable tool for probing 
the inhomogeneities of the matter distribution between us and the last scattering surface. 
Gravitational lensing is presently a very active area of astronomy, and the CMB is only one of 
the many types of objects whose gravitational lensing may be measured in order to probe the 
underlying mass inhomogeneities in the universe. Gravitational lensing of the CMB may be 
contrasted with competing weak lensing probes in that: (i) for the CMB, the sources (i.e., the 
objects being lensed) are the most distant possible, situated at z ~ 1100, (ii) because of the 
above, CMB lensing is sensitive to clustering at larger redshifts, thus less sensitive to nonlinear 
corrections, and (iii) unlike observations of the correlations of the observed ellipticities of 
galaxies, where there are systematic errors due to intrinsic alignments, for CMB lensing there 
is no such problem. 
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Figure 17: Power spectrum of CMB gravitational lensing potential as measured by 
Planck. The angular power spectrum for the lensing potential as reported in the Planck 2013 
results is shown above. The overall statistical significance for a lensing detection is greater 
than 25cr. The solid curve indicates the expectation based on the concordance model whose 
six parameters have been fixed by the Planck data combined with ancillary data. For more 
details, see Ref. [171|. ( Credit: ESA/Planck Collaboration ) 


In the linear theory (which is a good approximation, especially for a heuristic discussion), 
the deflection due to lensing may be described by means of a lensing potential, defined as 
a function of position on the celestial sphere. Let fl U niens coor d in at, ize the last scattering 
surface as it would look in the absence of lensing, and define f2i en s to be the position on the 
last scattering surface with lensing taken into account. We may express 

fllens = flunlens T V$l en s(f^)- (57) 

It follows that 

Re ns Tijniens (Vilens) ' (VT un iens), (58) 

and similar expressions may be derived for the polarization. As in weak lensing, we may derive 
a projected Poisson equation for the lensing potential 

/*-^source 

V $l ens (f2) = / dx XV (pC, ^source) Sp(Cl^ x), (59) 

To 

where V 2 is the Laplacian operator on the sphere. More details about the lensing of the CMB 
and what can be learned from it may be found in the review m and references therein. 

The first discovery of gravitational lensing of the CMB involved detecting a statistically 
significant nonzero cross-correlation between a lensing map reconstructed using the WMAP 
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data and infrared galaxies [2D~9j. The use of a cross-correlation allows one to detect a small 
signal within a noisy map. Figure p~7| shows the gravitational lensing spectrum as measured 
by Planck mu. Measurements of gravitational lensing have also been made by ACT [39] 
and SPT [223], and the Polarbear experiment has recently reported the discovery of B modes 
due to lensing [186]. The lensing held described above (which can be expressed either as a 
potential, as a deflection held, or as a dilatation and shear held) can in principle be recovered 
from distortions in the small-scale power spectrum due to the lensing held on larger scales. As 
a general rule, the higher the resolution of the survey, the better the reconstruction. Hu and 
Okamoto [9T[ T46 j proposed a reconstruction based on Fourier modes, and later many other 
estimators have been proposed, which are all quite similar because they are all exploiting the 
same signal. In the future, as observations at higher sensitivity and in particular due to B 
modes at small scales become available, lensing promises to become a powerful probe of the 
clustering of matter and halo structure. It is hoped that lensing will eventually be able to 
determine the absolute neutrino masses, even if they are as low as allowed by present neutrino 
oscillation data mu. Lensing detected through B modes is particularly advantageous because 
the fact that the B modes should be zero in the absence of lensing means that cosmic variance 
does not intervene and that only the instrument noise is an issued 


9 CMB Statistics 


9.1 Gaussianity, non-Gaussianity, and all that 


For the simplest models analyzed at linear order, the primordial CMB signal is the outcome of 
an isotropic Gaussian random process on the celestial sphere. While any particular realization 
of this random process produces a sky temperature map that is not isotropic, the hypothesis 
is that the underlying stochastic process is isotropic. Concretely the hypotheses of isotropy 
and Gaussianity imply that the probability of obtaining a sky map T(f2), whose multipole 
expansion in terms of spherical harmonic coefficients is given by 


is as follows: 
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(61) 


Here = (6, (p) denotes a position on the celestial sphere and the set of positive coefficients 
{Cf 1 } represents the angular power spectrum of the underlying stochastic process, or the 
theoretical power spectrum. 

If we had postulated Gaussianity alone without the additional hypothesis of isotropy, we 
would instead have obtained a more general probability distribution function having the form 


P({ae m }) = det 1/2 (2nC th ) exp 


1 

2 


a £rn i.C'th) a £'m' 

£,m l' ,m' 


(62) 


q This conclusion hardly depends on r whatever its final value may turn out to be, because the angular 
scales best for probing lensing lie beyond the recombination bump of the B modes. 
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Here C t h is the covariance matrix for the spherical harmonic multipole coefficients. This Ansatz 
is too general to be useful because there are many more parameters than observables, and in 
cosmology we can observe only a single sky—or said another way, only a single realization of 
the stochastic process defined in Eq. (62). The assumption of isotropy is very strong, greatly 
restricting the number of degrees of freedom by setting all the off-diagonal elements to zero 
and requiring that the diagonal elements depend only on i. 

Happily, the present data, subject to some caveats (such as gravitational lensing, discussed 
above in Sec. [8]), seem consistent with the hypotheses of isotropy and Gaussianity, notwith¬ 
standing some “anomalies” at moderate statistical significance (discussed in detail below in 
Sec. 12). These anomalies could be a sign of something new, but given their limited statistical 
significance, the argument that they are simply statistical “flukes” cannot be rejected. 

For completeness, we now indicate how the above formalism is extended to include the 
polarization of the primordial CMB signal. There are several formalisms for describing the 
polarization of the CMB, for example, using spin-weighted spherical harmonics, or using 
the Stokes parameters /, Q, U, V [23]. (The polarization of the CMB and its description 
are discussed in Refs. 35, 96, and 108) We may define T ab (Q) = (£ a (Cl)* £ b (Cl)) time where 
£ a (f2) is the electromagnetic field component propagating from the direction and units are 
chosen so that for a unit electric polarization vector e, and T ab (££)£ a £ b represents the CMB 
thermodynamic temperature when by means of a linear polarizer only the component along 
e is measured. We may decompose 


T ab {Cl)=T{£l)6 ab + P ab {£l), 


(63) 


where the polarization tensor (containing the Stokes parameters Q, U and V) is trace-free and 
Hermitian (and also symmetric in the absence of circular polarization). 

We must now choose a convenient basis for the polarization. If we considered a single point 
on the celestial sphere, we could using Stokes parameters set Q = T u — T 22 and U = 2Ti 2 , but 
no natural choice for the directions orthonormal unit vectors e\ and e 2 in the tangent space 
at H can be singled out as preferred. Had we instead chosen 




we would instead have the Stokes parameters 



cos 29 
-sin 29 


sin 29 
cos 2 9 



(64) 


(65) 


The transformation law in Eq. (65) indicates that the traceless part of the polarization is of 
spin-2. If the sky were flat with the geometry of R 2 instead of S 2 and we had to include 
homogeneous modes as well, there would be no way out of this quandary. But there is a 
celebrated result of differential geometry that a fur on a sphere of even dimension cannot be 
combed without somewhere leaving a bald spot or singularity])] A traceless tensor field on the 
sphere cannot have an everywhere vanishing covariant derivative. 


r This is a consequence of the “hairy ball” theorem m- 
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We can generate a complete basis for the polarization on the celestial sphere by means of 
derivative operators acting on the spherical harmonics. Using Yg m (Cl) as a starting point, we 
may define an E mode basis vector as follows: 

E ab . M (Sl) = Nf' B [w a w b - \s ab V 2 ^j y, m (Q), (66) 

where Nf’ B is a normalization factor. Here a,b = 1,2 indicate components with respect to an 
arbitrary orthonormal basis on S 2 . Without introducing more structure, this is the only way 
to produce a second-rank traceless tensor on the sphere with derivative operators acting on 
Yi m (Ct) in a way that does not break isotropy, a feature that ensures that E a b-/ m (Cl) transforms 
under rotations according to the quantum numbers £m. However, there is another polarization, 
rotated with respect to the E mode by 45°, and in order to write down a basis for this other 
polarization, known as the B modes, it is necessary to introduce an orientation on the sphere 
through the unit antisymmetric tensor eu = 1. This choice of orientation, or volume element 
in the language of differential geometry, defines a handedness, or direction in which the 45° 
rotation is made. 

We may define 

B a b ^ ^aa 1 T f-aa f ) {^bb f T ^bb 1 ') E a 'b', (67) 

or in terms of derivative operators 

B a b = - N^’ B (e ac 5bd + Q>c<5 a d)V c VdW m (fi). (68) 

The difference between E modes and B modes is illustrated in Fig. [18} We want to normalize 
so that 

J dClE a b/ m (tl)El b l , m ,(Q) = j dQ,B a b/ m {(l)B* abi , m ,{(l) = 8^e8 m ,m' (69) 

and 


dnE abM (n)B ab/ , m ,(n) — o, 


so we require that 


N e,b = 

e A/ (£ + 2)(£+ !)£{£-1)' 


(70) 


(71) 


Although the above discussion emphasized the role of spherical harmonics, the distinction 
between E and B modes does not rely on spherical harmonics, and much confusion has 
resulted from not fully appreciating this point. The absence of an E mode component or a B 
mode component may be formulated in a completely local manner using covariant derivative 
operators acting on the polarization tensor P ab (Cl). If V a P ab = 0, then the E mode is absent, 
and similarly if e a bV a Pb c = 0, then the B mode is absent. Consequently, if we are willing to 
differentiate the data, we could detect B modes locally with absolutely no leakage. However, 
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Figure 18: E and B modes of the CMB polarization. The polarization pattern in the left 
panel can be expressed as the double derivative acting on a potential with its trace removed 
(in this case a Gaussian potential) and thus is an E mode. The B mode pattern on the 
right, however, cannot be represented in this way, but can be represented by a psuedo-scalar 
potential whose double derivative with the trace removed followed by a 45° rotation gives the 
indicated pattern. 


the above does not imply that a unique decomposition of P a b into E and B on an incomplete 
sky is possible because nonsingular vector fields (akin to harmonic functions) can be found 
that satisfy both conditions, although no nontrivial such vector fields exist defined on the full 
sky. Where global topology does enter is in eliminating configurations that satisfy both the 
above differential constraints. 

Having defined the E and B modes, we now indicate how the statistical distribution defined 
in Eq. (61) is modified to include polarization. We modify our notation somewhat to avoid 
unnecessary clutter. When polarization is included, our vector {aj m } is generalized to become 


{a 


lm i a tm i Uj £m 


dn 


j}, which we shall denote as the vector (t,e, b). For each £ we have a positive 


definite covariance matrix of the form 



ter 

r< te 


c t = 

^et 

r<EE 

/^EB 

u e 


\cf T 

^(BE 

Cf B ) 


(72) 


If we assume that the underlying physics imprinting the cosmological perturbations is invariant 
under spatial inversion, then the above covariance matrix simplifies to 


C e 


/CJ T Cj E 0 \ 
Cf T C EE 0 

V 0 0 Cf7 


because under spatial inversion 

/tTB , ^TB /iEB , y^EB 

W “^ ~ u £ ? Ly £ ~ W 


(73) 


( 74 ) 
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whereas the other power spectra coefficients preserve their sign. 

If parity is a symmetry of the physical processes generating the primordial perturbations 
and imprinting the CMB anisotropies, the expectation values Cj B,th and Cf B ’ th must vanish, 
although for any particular sky realization Cj B,ohs and £7 BB ’ obs will not vanish because of 
cosmic variance. (Recall that Cj B,th and Cf B,th is the average over a fictitious ensemble 
including an infinite number of sky realizations.) Studying Cj B,ohb and C BB ’ obs to search for 
statistically significant deviations from zero is a way to search for parity violation in the very 
early universe. However, in practice adjusting experimental parameters to minimize cJ B ’ obs 
and c BB ’ obh i s often used to calibrate detector angles in B mode polarization experiments. 

The propagation of light, or electromagnetic radiation, at least historically and conceptu¬ 
ally, occupies a privileged place in the theories of special and general relativity, because of 
the constancy of the speed of light in vacuum and the fact that at least in the eikonal ap¬ 
proximation, light travels along null geodesics, with its electric polarization vector propagated 
along by parallel transport. It is by thinking of bundles of light rays that we construct for 
ourselves a physical picture of the causal structure of spacetime. A modification of the propa¬ 
gation of light (for example, from dilaton and axion couplings [144 ]. a possible birefringence of 
spacetime [63j, 12121101 . 32] . or alternative theories of electromagnetism [71, EJ 1145] ) would 
not necessarily undermine the foundations of relativity theory, but would probably merely 
lead to a more complicated theory, possibly only in the electromagnetic sector. The CMB 
may be considered an extreme environment because of the exceptionally long travel time of 
the photons, and one can search for new effects beyond the known interactions with matter 
(e.g., Thomson and other scattering, collective plasma effects such as dispersion and Faraday 
rotation). Some such effects would lead to spectral distortions, for example through missing 
photons. The rotation of the polarization vector during this long journey would lead to mixing 
of E and B modes, and since the primordial E modes are so much larger, leads to interesting 
constraints on birefringence, which will greatly improve as more data comes in from searches 
for primordial B modes. 

The statistical description in Eqs. (72) and (73) suffice for confronting the predictions of 
theory with idealized measurements of the microwave sky with complete sky coverage and 
no instrument noise. We must further assume that there are no secondary anisotropies, nor 
galactic foregrounds, both of which have been perfectly removed. 

Even under these idealized assumptions, it is not possible to pin down the theory com¬ 
pletely because of a phenomenon known as cosmic variance. The problem is that while we 
would like to characterize the covariance of the underlying theory Cf B ' th (where A, B = 
T,E,B), we have only a single realization of the microwave sky, and (jy B ' ohs is only an es¬ 
timate of Cy B ' th with fluctuations about its expectation value which is equal to Cy B,th . If 
we consider just the temperature fluctuations, we may define the sample variance for the tth 
multipole 


^TTjObs 


l 

(2 l + 1 ) 


+e 

£ \aem\ 2 - 
m=—£ 


(75) 


The random variable Cj T,ohs obeys a ^-distribution with (2^+1) degrees of freedom and thus 
has fractional variance of \j2/{20. + 1). If we consider a microwave sky map bandlimited up to 
f-max; there are approximately ^ max 2 degrees of freedom, so for example the overall amplitude of 
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the cosmological perturbations may be determined with a fractional accuracy of approximately 

l/4iax- 

Most analyses of the CMB data (possibly combined with other data sets) assume a model 
where the theoretical power spectrum depends on a number of cosmological parameters, which 
we may abstractly denote as an, a 2) • • •, &d, also written more compactly as the vector a. For 
example, in the six-parameter minimal cosmological model claimed sufficient in the 2013 
Planck Analysis [ l70j to explain the present data, the parameters comprising this vector were 
A s ,n s ,H 0 ,uj b , (Ucdm and r. 

Whether one adopts a frequentist or Bayesian analysis to analyze the data, the input is 
always the relative likelihood of the competing models given the observations, or a ratio of 
the form: 


P(observations | aq) 

P(observations | oq) 

In this ratio, the measure for the probability density of the outcome, represented by a contin¬ 
uous variable, cancels. 

In this abridged discussion, we shall follow a Bayesian analysis where a prior probability 
Pprior(<*) is assumed on the space of models. In Bayesian statistics, Bayes’ theorem is used to 
tell us how we should rationally update our prior beliefs (or prejudices) in light of the data or 
observations, to yield a posterior probability to characterize our updated beliefs given by the 
formula 


P 


posterior 


(X 


P (observations | CK)P prior (a:) 
f d D a'P (observations | a')P prior (a') 


(77) 


There is no reason why the posterior distribution should lend itself to a simple analytic form, 
and a common procedure is to explore the form of the posterior distribution using Markov 
Chain Monte Carlo (MCMC) methods [123, 37], which are particularly well suited to explore 
distributions of large dimensionality. 

However, it often occurs that the likelihood and the posterior distribution can be ade¬ 
quately represented as a Gaussian by expanding about its maximum likelihood value. In this 
case, the maximum of the log of the probability can be found using a numerical optimiza¬ 
tion routine, and then by computing second derivatives using numerical finite differences, the 
Gaussian approximation to the posterior can be found. This infinitesimal analysis is often 
referred to as a Fisher analysis. 


9.2 Non-Gaussian alternatives 

It is incredibly difficult to test the Gaussianity of the primordial microwave sky without some 
guidance from theory as to what non-Gaussian alternatives are well motivated. The space 
of non-Gaussian stochastic models is dauntingly vast, with the space of Gaussian models 
occupying by any measure an almost negligible fraction of the model space. If one tests enough 
models, one is guaranteed to produce a result beyond any given threshold of significance. 
Consequently, one of the questions lurking behind any claim of a detection of non-Gaussianity 
is how many similar models did one test. 
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10 Bispectral Non-Gaussianity 


Although it was recognized that inflation, in its simplest form described by Einstein gravity 
and a scalar field minimally coupled to gravity, would have some nonlinear corrections, early 
analyses of the cosmological perturbations generated by inflation linearized about a homoge¬ 
neous solution and calculated the perturbations in the framework of a linearized (free) field 
theory. According to the lore at the time, the nonlinear corrections to this approximation 
would be too small to be observed. The first calculations of the leading nonlinear corrections 
were given by Maldacena [131] and Aquaviva et al. [4], who calculated the bispectrum or 
three-point correlations of the primordial cosmological perturbations within the framework of 
single-field inflation. 

Subsequent work indicated that in models of multi-field inflation, bispectral non-Gaussianity 
of an amplitude much larger than for the minimal single-field inflationary models can be ob¬ 
tained. The predictions of many of these multi-field models were within the range that would 
be detectable by the Planck satellite. For many of these models, the non-Gaussianity, owing 
to the fact that it is generated by dynamics on superhorizon scales where derivative terms are 
unimportant, is well described by the “local” Ansatz for bispectral non-Gaussianity, under 
which the non-Gaussian field (nl is generated from an underlying Gaussian field Cl according 
to the rule 


= G.M + [CiM] 2 , 


(78) 


where f^L is a dimensionless parameter that quantifies the degree of non-Gaussianity. 

Here /ml is independent of wavenumber, but a dependence on wavenumber can also be 
contemplated, so that the above relation is generalized to the following (now working in 
wavenumber space): 


C»i(k)= / d?k , d 3 k 2 f KL (k- t h-,k 2 )5 3 ( k - k, - k 2 )Ci(k,)a(k 2 ). 


(79) 


Here as a consequence of the hypothesis of isotropy, we have expressed f^L as a function 
depending only on the lengths of the vectors. Any two triangles (where the three vectors 
close) that may be mapped into each other by an isometry are assigned the same f^L- 


For bispectral non-Gaussianity of the local form, as defined in Eq. (78), we briefly sketch 
how the order of magnitude of ( S/N ) 2 for a detection of Jnl from a CMB map extending 
up to l max may be estimated. We simplify the estimate by employing the flat sky approx¬ 
imation, allowing us to replace the unintuitive discrete sums and Wigner 3 j symbols with 
more the transparent continuum integrals and a two-dimensional 5-function. Approximating 
Ce 
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(in other words, we ignore acoustic oscillations, damping tails, and all that) and 
assuming that the CMB is scale invariant on the sky, as would occur if one could extrapolate 
the simplified Sachs-Wolfe formula (A T)/T = — <f>/3 to arbitrarily small scales), we may write 
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The quantity in the first line simply approximates the sum over distinguishable triangles, 
the combinatorial factors being absorbed in the 0(1) factor, and the second line gives the 
signal-to-noise squared of an individual triangle, as calculated diagrammatically. More ex¬ 
act calculations taking into account more details confirm this order of magnitude estimate. 
Numerical calculations are needed to determine the 0(1) factor and deal with t max more care¬ 
fully. Nevertheless this rough estimate reveals several characteristics of local non-Gaussianity, 
namely: (i) the presence of the ln(£ max /^ min ) factor indicates that the bulk of the signal arises 
from the coupling of large angle modes and small angle modes (i.e., the modulation of the 
smallest scale power, near the resolution limit of the survey, by the very large angle modes 
(i.e., i — 2,3,...), and (ii) the cosmic variance of the estimator derives from the inability to 
measure the small-scale power accurately, and thus decreases in proportion to the effective 
number of resolution elements of the survey. Cosmic variance on large scales is not an is¬ 
sue. As long as we know the particular realization on large scales accurately (which is not 
subject to cosmic variance), we know what kind of modulation to look for in the small-scale 
anisotropies. 

Theoretical models for other shapes for the bispectral anisotropy can be motivated by 
models for fundamental physics (see for example Refs. [9] and HDD and references therein) and 
have been searched for as well. The Planck 2013 Results gave the following limits [177] : fmf ] = 
2.7 ± 5.8, /X' 1 — — 42 ± 75, and f§fl ho = —25 ± 39. This result rules out many of the models 
developed to explain a result at low statistical significance from some analyses of the WMAP 
data and was a profound disappointment to those who hoped that Planck would turn up 
striking evidence against the simplest inflationary models. One may anticipate that the results 
from Planck 2014 will improve modestly on the 2013 results by including more modes because 
of the use of polarization. If there is a next-generation all-sky CMB polarization satellite 
having good angular resolution so that S/N^ 1 maps may be obtained up to £ max ~ 3000, an 
improvement on the limits on /nl by a factor of a few may be envisaged. However, beyond 
i ~ 3000, the primary CMB anisotropies become a sideshow. As one pushes upward in £, other 
nonprimordial sources of anisotropy take over, such as gravitational lensing, the Sunyaev- 
Zeldovich effect, and point sources of various sorts. These contaminants have angular power 
spectra that rise steeply with l while the CMB damping tail is falling rapidly, in fact almost 
exponentially. For including more modes, polarization may be helpful. On the one hand, 
polarization measurements present formidable instrumental challenges because of their lower 
amplitude compared to the temperature anisotropies. However, for the polarization, the ratio 
of foregrounds to the primordial CMB signal is favorable up to higher £ than for temperature. 
For searching for non-Gaussianity, the primordial CMB is highly linear, unlike other tracers 
of the primordial cosmological perturbations. However given that the basic observables are 
two-dimensional maps, the small number of modes compared to three-dimensional tracers of 
the primordial perturbations is a handicap. 

11 B Modes: A New Probe of Inflation 

As discussed above, some limits on the possible contribution from tensor modes, or primordial 
gravitational waves, can be obtained from analyzing the C TT power spectrum. However, 
given that r is not large, these limits become highly model dependent, and because of cosmic- 
variance as well as uncertainty as to the correct parametric model, cannot be improved to 
any substantial extent. This difficulty is illustrated by the results from the Planck 2013 
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analysis m, which did not include the data on polarized anisotropies collected by Planck 
but only the TT power spectrum. If one assumes the minimal six-parameter model plus 
r added as an extra parameter, a limit of r < 0.11 (95% C.L.) is obtained. However, if 
the primordial power spectrum in this model (where a form P(k) ~ fc ris_1 is assumed) is 
generalized to P(k) ~ (k/k*) ris ^ 1 exp(a(ln(/c//c*)) 2 ), the limits on r loosen by about a factor 
of two to become r < 0.2 (95% confidence level). This lack of robustness to the assumptions 
of the parametric model highlights the fragility of this approach and demonstrates that the 
statistical error bars are not to be trusted unless one has strong reason to trust the underlying 
parametric model. However, given our current understanding of inflationary potentials and 
dynamics, we have no reason to trust any of the parametric models, which serve more as 
fitting functions for summarizing the current state of the observations. 

Searching for tensor modes using B modes, on the other hand, allows us to probe much 
lower values of r in a manner that is substantially model independent because in the linear 
theory “scalar” perturbations cannot generate any B modes, and this conclusion is independent 
of the parametric model assumed. 

11.1 Suborbital searches for primordial B modes 

In March 2014, the BICEP2 Collaboration [17[ [TSJ claimed a detection of primordial gravi¬ 
tational waves based primarily on observations at a single frequency (150 GHz) over a small 
patch of the sky (1% of the sky) that was believed to be of particularly low polarized dust 
emission. Using a telescope based at the South Pole, the BICEP2 team created a map at 
150 GHz of the polarized sky emission from which the B mode contribution was extracted. 
The BICEP2 analysis included many null tests to estimate and exclude systematic errors as 
well as to estimate the statistical noise in the measurement. However, in order to claim a de¬ 
tection of primordial gravitational waves, the BICEP2 team had to exclude a nonprimordial 
origin for the observed B mode signal, for which the main suspect would be polarized thermal 
dust emission. An independent analysis M called into question the BICEP2 estimate of the 
likely dust contribution in their held, claiming that all the observed signal could be attributed 
to dust. A subsequent Planck analysis estimating the dust contribution in the BICEP2 held 
by extrapolating polarized dust maps at higher frequencies where dust is dominant down 
to 150 GHz confirmed this finding [163] . While this analysis does not necessarily exclude a 
nonzero primordial contribution to the B modes observed by BICEP2 at 150 GHz, the argu¬ 
ment crucial to the BICEP2 claim that a dominant contribution from dust can be excluded 
collapses in light of this new hnding from Planck concerning the expected contribution from 
thermal dust emission. A cross-correlation study is now underway as a joint project of the 
two teams that will combine the BICEP2 map at 150 GHz with the Planck polarization maps 
higher frequencies (which are dominated by dust). It remains to be seen whether this effort 
can establish a limit on r better than what is currently available or whether this combined 
analysis might even result in a detection of primordial B modes albeit at a lower level of r. 

In any case, a number of experiments are now underway involving observations at multiple 
frequencies that will push down the limits on r using B modes if not result in a first detection 
if r is not too small. The BICEP2 team has a series of upgrades to their experiment expanding 
their frequency coverage as well as massively increasing the number of detectors and hence 
their sensitivity. Other competing efforts include POLARBEAR [8], SPIDER [34] and QUBIC 
[13], as well as ACTPol and SPTPol. 
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Moreover, farther into the future, a more ambitious upgrade to ground based efforts is 
contemplated through the US Stage 4 (S4) [H [2] CMB experiment currently under consider¬ 
ation by the US DOE. S4 contemplates deploying approximately a total of 2 x 10 5 detectors 
from the ground in order to achieve a massive improvement in raw sensitivity. Although it is 
claimed that an improvement in the control of systematic errors to a comparable level can be 
achieved, it remains to be seen whether S4 can realize its projected performance. 

11.2 Space based searches for primordial B modes 

Several groups around the world have proposed space missions specifically dedicated to map¬ 
ping the microwave polarization over the full sky with a sensitivity that would permit the 
near ultimate measurement of the CMB B modes. In Europe, three ESA missions had been 
proposed: BPol in 2007 |43j , COrE [33] in 2010, both of which were medium-class, and the 
large-class mission PRISM [ 189 J in 2013. None of these missions was selected. A mission pro¬ 
posal called COrE+ is currently in the process of being submitted. In Japan a mission called 
LiteBird [84] has been developed, but final approval is still pending. In the US, CMBPol/EPIC 
[61] has proposed a number of options for a dedicated CMB polarization space mission^] but 
none of these has been successful in securing funding. All the above concepts involve a fo¬ 
cal plane including several thousand single-mode detectors in order to achieve the required 
sensitivity. Another proposal endeavors to achieve the needed sensitivity by another means: 
multi-moded detectors. In the Pixie proposal, a Martin-Pupplet Fourier spectrometer 
comparing the sky signal to that of an artificial blackbody is proposed. The setup is similar 
to the COBE FIRAS instrument, except that polarization sensitive bolometers and a more 
modern technology are used, which will allow an improvement in the measurement of the 
absolute spectrum about two orders of magnitude better than FIRAS. 

12 CMB Anomalies 

The story recounted so far has emphasized the agreement of the observations accumulated over 
the years with the six-parameter so-called “concordance” model. But this would not be a fair 
and accurate account without reporting a few wrinkles to this remarkable success story. These 
wrinkles may either be taken as hints of new physics, or discounted as statistical flukes, perhaps 
using the pejorative term a posteriori statistics. On the one hand, one can argue that if one 
looks at enough models that are in some sense independent, one is bound to turn up something 
at high statistical significance, and by most standards the significance of these anomalies is 
not high. Which interpretation is preferred is presently under debate. Ultimately, despite all 
the fancy statistical terminology used in this discourse, what one concludes inevitably relies 
on a subjective judgment of theoretical plausibility. 

One anomaly explored over the years, known under several names including “hemispher¬ 
ical asymmetry,” “dipolar modulation,” and “bipolar disorder,” asks the question whether 
the local angular power spectrum is identical when compared between opposite directions on 
the celestial sphere. There is no unique way to pose this question precisely. The statistical 
significance, amplitude, and direction of the asymmetry depend somewhat on the precise for¬ 
mulation chosen. However, the WMAP Ending of a dipolar modulation with an amplitude 

s See http://cmbpol.uchicago.edu/papers.php for a list of papers on this effort. 
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of « 7% and a significance ranging within about 2 — 4cr, and pointing approximately toward 
(l, b ) = (237°, 20°) in galactic coordinates is broadly confirmed by the Planck 2013 Results. 
This confirmation by Planck, owing to the additional high frequency coverage, renders less 
plausible a nonprimordial explanation based on foreground residuals. The sensitivity to dipo¬ 
lar modulation on these angular scales is primarily limited by cosmic variance for both WMAP 
and Planck, so future experiments cannot hope to improve substantially on the significance of 
these results. Planck, however, because of its superior angular resolution, was able to probe 
for dipolar modulation pushing to smaller scales, where cosmic variance is less of a problem 
and a possible modulation can be constrained more tightly. [The measurement uncertainty 
in the scale-dependent dipole modulation of the amplitude of the perturbations A A(£) (as¬ 
suming broad binning) scales as 1/t as long as cosmic variance is the limiting factor.] The 
Planck 2013 data find no evidence of dipolar modulation at the same amplitude extending to 
small angular scales, suggesting that if the dipolar modulation observed by WMAP is not a 
statistical fluke, a scale-dependent theoretical mechanism for dipolar modulation is needed. 
Moreover, Planck does not see evidence of modulation associated with higher multipoles (e.g. 
quadrupolar and higher order disorders). This last point is important for constraining the¬ 
oretical models producing statistical anisotropy by means of extra fields disordered during 
inflation. 

Another anomaly is the so-called “cold spot.” Rather than limiting ourselves to two-point 
statistics, or to three-point statistics, searching for a bispectral signal using theoretically 
motivated templates, as described in Sec. [TO} we may ask whether the most extreme values of 
an appropriately filtered pure CMB map lie within the range that may be expected assuming 
an isotropic Gaussian stochastic process, or whether their p -values (i.e., probability to exceed) 
render the Gaussian explanation implausible. There are many ways to formulate questions of 
this sort, a fact that renders the assessment of statistical significance difficult. One approach 
that has been applied to the data is to filter the sky maps with a spherical Mexican hat wavelet 
(SMHW) filter, which on the sphere would have the profile of the Laplacian operator applied 
to a Gaussian kernel of a given width a, giving a broad [i.e., (A£)/£ = 0(1)] two-dimensional 
spatial bandpass filter. (See Ref. [2~24] for more details concerning the methodology and 
Ref. [225 ] for the original claim of a cold spot detection. See also Ref. [257] for an assessment 
by the WMAP team as well as Ref. mg for the analysis from Planck 2013.) 

The final anomaly concerns possible alignments of the low-£ multipoles, imaginatively 
named the “axis of evil” by Land and Magueijo m ■ A Gaussian isotropic theory for the 
temperature anisotropj *] predicts that the multipole vectors are each Gaus¬ 

sian and independently distributed. If we add three or more such vectors together according 
to the tensor product of representations of SO( 3) 

L l ®L 2 ®---®L N , (81) 

which may be decomposed into a direct sum of irreducible representations. Provided that the 
triangle inequality is satisfied, we may extract one or more scalars (transforming according 
to L — 0) from the above tensor product. The mathematics is simply that of the usual 
addition of angular momentum. For each of these scalars, we may ask whether the observed 
value lies within the range expected from cosmic variance in the framework of a Gaussian 
theory taking into account the uncertainty in the determination of the underlying power 

t The discussion can readily be generalized to polarization, but we shall stick to the temperature anisotropy 
alone in our discussion. 
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spectrum. The approach just described was not the approach of Land and Magueijo, who 
maximized max m6 [_^ + f] |a^ m (h)| as a function of n where is the expansion coefficient 

in the coordinate system with n pointing to the north pole. In this way, for each generic value 
of the multipole number i a unique double headed vector, or axis, ±n may be extracted, and 
the alignments between these axes may be assessed. Land and Magueijo report an alignment 
of the i = 3, £ = 4, and i = 5 axes with a p-value less than 10 -3 in the Gaussian theory. 
Accurate p values may be obtained for each of these invariant quantities by resorting to MC 
simulations to account for the practicalities of dealing with a cut sky, etc. However, there 
still remains a subjective element to assessing statistical significance. One may ask how many 
invariants were tried before arriving at a reportable statistically significant anomaly. 


13 Sunyaev-Zeldovich Effects 


In the simplified discussion of reionization presented in Sec. |7.5[ it was assumed that the 
electrons responsible for the rescattering are at rest with respect to the cosmic rest frame. This 
approximation treats the reionized gas as a cold plasma having a vanishing peculiar velocity 
held. As pointed out by Sunyaev and Zeldovich [215] , corrections to this approximation arise 
in two ways: (i) In the so-called thermal Sunyaev-Zeldovich effect (tSZ), the random thermal 
motions of the free electrons alter the spectrum of the rescattered CMB photons through the 
Doppler effect and Compton recoil. Because (v) = 0, this effect is second order in the velocity, 
or linear in the electron temperature, and results in a spectral distortion not respecting the 
frequency dependence of a blackbody spectrum with a perturbed temperature. (See Fig. 19 


(ii) In the kinetic Sunyaev-Zeldovich (kSZ) effect the peculiar velocity of the gas results in a 
shift in temperature proportional to the component of the peculiar velocity along the line of 
sight. The kSZ effect has the same frequency dependence as the underlying primary CMB 
perturbations, thus making it hard to detect given its small magnitude. There is also a 
much smaller effect where the transverse peculiar velocity imparts a linear polarization to the 
scattered photons for which the polarization tensor Py is proportional to (vj_<g)v_i_ — 
where Vi is the peculiar velocity perpendicular to the line of sight and <5j_ is the Kronecker 
delta function in the plane perpendicular to the line of sight. 

Galaxy clusters are filled with hot gas [T gas = 0(10 KeV)] that emits primarily at X-ray 
frequencies. The hot electrons of this fully ionized gas, or plasma, scatter CMB photons 
by Thomson, or Compton, scattering, shifting their frequency by a factor of approximately 
(1 + /3 cos 9) where /3 = k^Tjm e (? and we have ignored higher order corrections in /3. The 
//-distortion parameter along a line of sight is given by the integral 
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and the fractional perturbation in the CMB thermodynamic temperature in the nonrelativistic 
approximation is given by 
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where x = hv/k^TcMB = (57 GHz) and //(D) is the //-distortion map as defined by the line 


of sight integral in Eq. (82). 


51 












Figure 19: Frequency dependence of thermal 
Sunyaev-Zeldovich spectral distortions. The solid 

and dashed curves show the fraction spectral distortion 
in terms of the CMB thermodynamic and R-J tempera¬ 
tures, respectively. For low frequencies {v < 217GHz), 
there is a decrement in temperature, corresponding to 
the fact that low frequency photons are on the average 
Doppler shifted to higher frequencies by the hot elec¬ 
trons. On the other hand, for v > 217 GHz the net 
effect is to increase the photon phase space density. 


On the one hand, the thermal 
Sunyaev-Zeldovich effect provides a 
powerful probe of the dynamics of 
galaxy clusters, which can be used 
to discover new clusters and to 
probe the structure of known clus¬ 
ters away from their central core. 
On the other hand, for observ¬ 
ing the primordial CMB, the ther¬ 
mal Sunyaev-Zeldovich effect con¬ 
stitutes a contaminant that must ei¬ 
ther be removed or modelled. The 
review by Birkinshaw [19] recounts 
the early history of tSZ measure¬ 
ments. See also the reviews by Carl- 
strom et al. [29] and by Rephaeli 
[195]. For the observation of the 
kSZ, see Ref. [80], For more re¬ 
cent measurements, see the SZ sur¬ 
vey papers from the ACT [133] , 
SPT[252]. and Planck [T73i jr7%] col¬ 
laborations. See Ref. [ 159 ] for a dis¬ 
cussion of what might ultimately be 
possible from space in the future. 

14 Experimental As¬ 
pects of CMB Obser¬ 
vations 


Like astronomical observations at 
other wavelengths, most modern 
microwave experiments consist of a telescope and a number of detectors situated at its focal 
plane. The COBE experiment was an exception, as it used single-moded feed horns pointed 
directly at the sky to define its beam, which was relatively broad (7° FWHM). However for 
measurements at higher angular resolution, it is difficult to construct a feed horn forming a 
sufficiently narrow beam on the sky without the help of intermediate optics, unless one resorts 
to interferometry as for example in the DASI experiment. 

Observations of the sky at microwave frequencies present a number of challenges unique to 
this frequency range. Unlike at other frequencies, the microwave sky is remarkably isotropic. 
The CMB monopole moment, and to a much lesser extent the dipole moment, dominates. 
In the bands where other contaminants contribute least—that is, roughly in the range 70- 
150 GHz—virtually all the photons collected result from the isotropic 2.725 K background. 
The brightest feature superimposed on this uniform background is the CMB dipole, resulting 
from our proper motion with respect to the cosmic rest frame, at the level of 0.1%. When 
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Figure 20: Atmospheric optical depth as a function of frequency for 1 mm of pre- 
cipitable water vapor. Water vapor is an important but not the only source of atmospheric 
contamination. See Ref. [148j for more details. A column density of 0.5-1 mm of precipitable 
water vapor corresponds to good conditions in the Atacama deserts. Conditions at the South 
Pole are often better by a factor of a few. The black curve shows the opacity, or optical depth, 
and the red curve shows the opacity multiplied by 1/20 in order to highlight the structure 
around the resonances. The limited transmission 0 < t < 1 [where t = exp(—r)], on the one 
hand, attenuates the desired sky signal. But more serious is the superimposed additive noise, 
having a brightness temperature (1 — t)T atm . Here T atm is the effective temperature of the 
atmosphere. ( Credit: J. Cernicharo ) 


this dipole component is removed, the dominant residual is the primordial CMB anisotropy, 
whose amplitude on large angular scales is about 35pK, or roughly 10~ 5 . This situation is 
to be contrasted for example with X-ray astronomy where only some 50 photons need to be 
collected to make an adequate galaxy cluster detection. For the CMB about 10 10 photons 
must be collected in each pixel just to obtain the temperature anisotropy at a signal-to- 
noise ratio of one, and many more photons are needed for measuring polarization at the 
same marginal signal-to-noise. The challenge is to measure minute differences in temperature 
between different points on the celestial sphere without introducing spurious effects. These 
can arise from a variety of sources: drifts in the zero point of the detector, detector noise, hot 
objects in the far sidelobes of the beam, and diffraction around the edges of the mirrors, to 
name some of the most common problems. 

For this reason, almost all measurements of the microwave sky are in some sense differential. 
Drifts in the zero point of the detector (known as 1// noise) can be mitigated by rapidly 
switching either between the sky and an artificial cold load with a long time constant or 
between different parts of the sky. This can be accomplished by moving the beam rapidly 
across the sky, so that in one way or the other the basic observable becomes differences in 
intensity of pixels whose measurements are closely separated in time. The point to take away 
is that one absolutely avoids making direct rather than differential measurements of the sky 
temperature. 

Another specificity of astronomical observations at microwave frequencies is that extremely 
stringent requirements must be imposed limiting the magnitude of the far sidelobes of the 
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beam. Given the smallness of the differences in temperature of interest, contamination from 
the ground, the earth, the sun, and moon, and also from the galaxy near the galactic plane, 
can easily introduce spurious signals if the beam does not fall sufficiently rapidly away from 
its central peak. Ground pickup (from a spherical angle of 2n) is particularly challenging 
to shield, and this is why the WMAP and Planck satellites were situated at L2 (the second 
Lagrange point, where the earth and the moon have a tiny angular diameter) rather than in 
a low-earth orbit, which would equally well avoid atmospheric emission, which is the other 
major motivation for going to space. The second earth-sun Lagrange point (L2) is 1.5 x 10 6 km 
from the earth—that is, about 4 times the earth-moon distance. From L2 the earth subtends 
an angle of only 16 arcminutes. More importantly from L2, the sun, earth, and moon all 
appear at approximately the same position in the sky making it easier to keep away from the 
far sidelobes of the beam. 

A formidable challenge for suborbital observations is emission from the atmosphere, pri¬ 
marily but not exclusively from water vapor. The emission from the atmosphere has a com¬ 
plicated frequency dependence, as shown in Fig. 20, where the optical depth as a function of 
frequency is shown. This optical depth must be multiplied by the atmospheric temperature 
(i.e., ~ 270 K) to obtain the sky brightness. The structure in frequency of the regions of 
high opacity implies that bands must be carefully chosen to avoid emission lines and bands, 
giving less flexibility for wide frequency coverage than from space. The problems arising from 
atmospheric emission can be mitigated to some extent by choosing a site on the ground with 
a particularly small water column density, such as the South Pole or the Atacama desert in 
Chile, or by observing from a stratospheric balloon. Atmospheric emission is problematic in 
two respects. First, if the atmospheric emission were stable in time, it would merely introduce 
additional thermal loading on the detectors, which could be mitigated simply by deploying 
more detectors or by observing over a longer time. If the contribution from the sky is T sky , 
then the fluctuations in intensity at the detector (assumed perfect for the moment) would 
increase by a factor of (T sky + Tcmb)/1cmb ? meaning that we could measure the sky tempera¬ 
ture with the same error by increasing either the number of detectors or the total observation 
time by a factor of ((T sky + Tcmb)/Tcmb) 2 - At 150 GHz from the South Pole, for example, to 
choose a band where the sky brightness temperature is not very high, one has T sky ~ 16 K. 
This means that under the idealized assumptions made here, almost 40 times more detectors 
would be required than for a space-based experiment. But the situation is worse than this, 
because the atmosphere contamination is not stable in time but rather fluctuates in a compli¬ 
cated way characterized by a wide range of time scales (owing to the underlying turbulence). 
Moreover, the atmospheric load varies with zenith angle, roughly according to a secant law. 
Of course, observations from the ground are attractive because of their low cost, the fact that 
telescopes can be improved from season to season to address problems as they arise, and larger 
telescopes with many more detectors can be deployed. As experiments increase in sensitivity, 
requirements for mitigating these problems will become more stringent. At present it is not 
known what is the limit on the quality of measurements that can be achieved from the ground 
or from balloons. 


14.1 Intrinsic photon counting noise: ideal detector behavior 

In radio astronomy (where the frequency of observations is typically lower than for CMB 
observations) the following expression, known as the radio astronomers’ equation [45], lll j. 
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gives the fractional error of an observation under slightly idealized assumptions: 
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Here t 0 b s is the time of observation of a pixel and (A B) is the bandwidth. T s ^ y is the sky 
brightness temperature averaged over the pixel and T sys represents the additional noise in¬ 
troduced within the receiver. This expression can be understood by considering the source 
in the sky as having thermal statistics—that is, fluctuating in intensity as a Gaussian ran¬ 
dom held. In radio astronomy, detection is coherent and each detector picks up only a single 
transverse mode of the electromagnetic held incident from the sky, which is converted by an 
antenna or a feed horn into an electrical signal, which depends only on time. This signal is 
coherently amplified and nowadays digitized. Mathematically, in the absence of additional 
detector noise, the sky signal can be thought of as a signal whose complex amplitude fluctu¬ 
ates according to a Gaussian distribution with a variance proportional to T s k y . The coherence 
time is 1/(A B), so in a time interval t 0 ^ s ,N sajinp = 2(AB)t 0 b s independent realizations of this 
Gaussian random process whose variance we are trying to measure are collected, leading to a 
fractional error in the determination of the variance, or T s k y , of 1/y] iV san //. For a more realistic 
measurement noise is also introduced, and in the above formula the noise is idealized as an 
independent additive Gaussian random signal, characterized in terms of a system temperature 
T sys , to be added to ^sky to obtain the variance actually measured. The additive noise lumped 
together and known as the “system” temperature includes thermal emission from the atmo¬ 
sphere, Johnson noise from dielectric losses in the feedlines, noise in the detector itself—in 
other words, everything other than the sky temperature that one would measure from space 
with an ideal measuring device. 

The above formula was derived treating the incoming electromagnetic held as entirely 
classical, and for a thermal source this treatment is valid in the R-J part of the spectrum—that 
is, when T source ^ hv/k-Q. For the CMB, with T CMB = 2.725 K, this requires that u <> Vcmb = 
^B^cMB/h = 57 GHz, meaning that quantum effects introduce significant corrections to the 
above formula, but one is still far from the regime where photons arrive in a nearly uncorrelated 
way, obeying Poissonian counting statistics, which is what happens for observations in the 
extreme Wien tail of the blackbody distribution. Before introducing the quantum corrections 


to Eq. (84), let us finish our discussion of this result based on classical theory extracting all 
the lessons to be learnt from this result. In the R-J regime, where N 1, N being the 
photon occupation number, roughly speaking, photons arrive in bunches of N photons, so the 
discreteness of the photons is not an issue. As long as a fraction greater than about 1/A of the 
photons is captured, little additional noise is introduced. The intrinsic noise of the incoming 
electromagnetic held is almost entirely classical in origin and can be modeled faithfully using 
stochastic classical electromagnetic formalism. (For a more detailed discussion of these issues, 
see Ref. rao From the above formula, we learn that noise can be reduced by the following 
measures: (i) choosing as wide a bandwidth as possible, (ii) increasing the observation time, 
and (iii) increasing the number of detectors. We also learn that although lowering the internal 
noise of the detection system increases the accuracy of the measurements, once the system 
temperature is approximately equal to the brightness temperature of the source, lowering the 
system temperature further leads only to marginal improvement in sensitivity. 

We now derive the quantum corrected version of the radio astronomers equation, using 
a Planck distribution instead of the classical description based on a deterministic stochastic 
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Gaussian field. For the Planck distribution, where pjy = x N /(I — x) with x = exp (—hu/k B T), 


(( 5N ) 2 ) = (N) 2 + (N) = N 2 + N. 


(85) 


In the R-J regime, where N 1, the hrst term dominates, reproducing the result obtained 
using a classical random held as described above. But as the frequency is increased and one 
starts to pass toward the Wien tail of the distribution, the second term increases in impor¬ 
tance. In the extreme Wien tail, where the second term dominates, we observe the fluctuations 
characterized by Poisson statistics, where the (rare) arrivals of photons are completely un¬ 
correlated. This means that the fractional error for the number of photons counted must be 
increased by a factor of 


1 + N ' = 6XP 


hu 


^k B {T sky + T sys ) J ’ 


and the quantum-corrected version of Eq. (84) becomes [113] 


si_ _ ( T sk y + T sys \ 2 1 ex / hv 

I \ T sky J ^/ (AR)f 0 bs \ 2 k-p, (T sky + T sys ) 


( 86 ) 


(87) 


14.2 CMB detector technology 

Having analyzed the performance of an ideal detector limited only by the intrinsic fluctuations 
of the incoming electromagnetic held, we now review the state of the art of existing detector 
technologies, which may be divided into two broad classes: (i) coherent detectors, and (ii) 
incoherent detectors. 

Coherent detectors transfer the signal from a feed horn or antenna onto a transmission line 
and then coherently amplify the signal using a low-noise amplifier, generally using high elec¬ 
tron mobility transistors (HEMTs) and switching with a cold load or between different points 
on the sky in order to mitigate 1/f noise [187] . Coherent detection is an older technology. 
Coherent detectors have the advantage that they can operate at much higher temperatures 
than the competing incoherent bolometric detectors. Avoiding active cooling through a cryo¬ 
genic system is a definite plus, especially in space, where reducing risk of failure is important. 
However, unlike their incoherent competitors, at the high frequencies of interest for CMB 
observations, coherent detectors have noise levels far above the quantum noise limit, and until 
now coherent detectors have not been able to reach frequencies above circa 100 GHz, although 
some believe that this situation may improve. It is not presently known how coherent ampli¬ 
fication would function in the Wien part of the blackbody spectrum. Coherent detectors were 
used for the COBE DMR experiment, for WMAP (including the frequencies 20, 30, 40, 60, 
and 90 GHz), and the Planck LFI (low frequency instrument), which included the frequencies 
30, 45, and 70 GHz. One advantage of coherent detectors is their insensitivity to cosmic rays. 

Another property of coherent detectors is that for a single mode measurement, all four 
Stokes parameters (i.e., I, Q, U, V) can be measured simultaneously. For incoherent detectors 
one can measure only two Stokes parameters simultaneously. However for measurements 
outside the R-J part of the spectrum, this advantage of coherent detectors disappears, because 
in the Wien part of the spectrum, the photons are not bunched to any appreciable degree. On 
the other hand, for coherent detectors, it is not possible to construct multi-moded detectors, 
where the number of detectors can be greatly reduced by sacrificing angular resolution, but 
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not at the cost of less sensitivity on large angular scales. Coherent detectors are always 
single-moded, sampling at the diffraction limit. 

Incoherent detectors, unlike their coherent counterparts, do not attempt to amplify the 
incident electromagnetic wave. Rather the incident electromagnetic wave, or equivalently the 
stream of photons, is directly converted into heat, changing the temperature of a small part 
of the detector of small heat capacity, whose temperature is monitored using a thermistor of 
some sort. Typically the detector is cooled to an average temperature much lower than that of 
the incident radiation. The bolometers of the Planck high frequency instrument (HFI) were 
cooled to 0.1 K. Consequently each CMB photon produces many phonons in the detector, 
and therefore the counting statistics of the phonons do not to any appreciable extent limit the 
measurement. Heat flows from the sky onto the detector and is then conducted to a base plate 
of high heat capacity and thermal stability. A thermal circuitry (which can be modelled as a 
sort of RC circuit with as many loops as time constants) is chosen with a carefully optimized 
time response. 

Several technologies are used for the thermistor. One of the most popular nowadays is 
transition edge sensors (TES) [118l| . where a superconducting thin him is kept at the edge 
of the superconducting transition, so that minute changes in temperature give rise to large 
changes in resistivity. A feedback circuitry with heating is used to maintain the assembly 
near a fixed location on the normal-superconducting transition edge. One disadvantage of 
TES detectors is their low electrical impedance, which requires that SQUIDs (superconducting 
quantum interference devices), which need careful magnetic shielding, be used for the readout. 

Several new promising technologies are being developed and perfected for the next genera¬ 
tion of bolometric detectors. One such technology is kinetic inductance detectors (also known 
as KIDs) [Til ITT] . For these detectors, a resonant electric circuit is constructed by depositing a 
pattern of superconducting him on a thick dielectric. Photons incident on the superconducting 
him break Cooper pairs causing its surface impedance to vary with the incident photon hux, 
which in turn alters the resonant frequency. The shifts in frequency of the resonator allow the 
KID to be read out in a simple way without using SQUIDs. The surface impedance of the 
superconducting him is largely determined by the inertia of the superconducting electrons, 
hence the name “kinetic inductance.” 

Another possibly promising technology is the cold electron bolometer m, involving a 
small metal filament connected to a superconductor on both sides by means of a junction 
with a thin insulator in between. A bias current that passes through the assembly acts to cool 
the electrons in the thin wire through a Peltier-like effect to a temperature far below that of 
the substrate. CEBs are hoped to be more resistant to cosmic ray spikes than other detector 
technologies. 

Cosmic rays have proved to be more problematic than anticipated in the Planck experi¬ 
ment, and substantial effort was needed to remove the contamination from cosmic ray events 
in the raw time stream [ 185] . Some of the data was vetoed and corrections were applied to the 
long-time tails of the larger magnitude events. While experiments from the ground benefit 
from substantial shielding of cosmic rays by the Earth’s atmosphere, how best to minimize 
interference from cosmic rays will be a major challenge for the next generation of experi¬ 
ments from space, which target sensitivities more than two orders of magnitude beyond the 
Planck HFI. Another challenge will be multiplexing, which is necessary to reduce the cooling 
requirements when the number of detectors is greatly expanded. 
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Figure 21: Summary of T,E, and B anisotropy power spectra from scalar and tensor 
modes. The green curves (from top to bottom) represent the TT, TE and EE CMB power 
spectra for the “scalar” mode, while the blue curves represent the CMB isotropies for the 
tensor mode. The broken blue curves (from top to bottom) and the top solid blue curve 
represent the TT, TE, EE and BB anisotropies assuming a value of the tensor-to-scalar ratio 
of r = 0.1. The lower two solid curves represent the predicted BB anisotropy for r = 0.01 and 
r = 0.001. The red curve shows the BB power spectrum arising nonlinearly from the scalar 
mode as a result of gravitational lensing. ( Credit: M. Bucher) 


14.3 Special techniques for polarization 


If it were not for the fact that the polarized CMB anisotropy (i.e., the Q and U Stokes param¬ 
eters) is much smaller than the anisotropy in the / Stokes parameter, measuring polarization 
would not pose any particular problems, because most detectors are polarization sensitive or 
can easily be made so, and there would be no need to discuss any techniques particular to 
polarization measurements. An inspection of the relative amplitudes of the spectra in Fig. 21 
highlights the problems encountered. In particular if one wants to search for B modes at the 
level of r ps 10 -3 or better, the requirements for preventing leakages of various types become 
exceedingly stringent, as we now describe. 

Let us characterize the problem for the most difficult polarization observation: measuring 
the primordial B modes. We may classify the leakages to be avoided by ordering them from 
most to least severe: (i) leakage of the CMB temperature monopole to the B mode, (ii) leakage 
of the T anisotropies to the B mode, and (iii) leakage of the E mode to the B mode. 

One of the solutions to many of these problems is polarization modulation. The idea is 
to place an element such as a rotating half-wave plate, whose orientation we shall denote by 
the angle 6, between the sky and the detector, preferably as the first element of the optical 
chain, as sketched in Fig. 22 Half-wave plates can be constructed using anisotropic crystals 


such as sapphire or using a mirror with a layer of wires a certain distance above it so that one 
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linear polarization reflects off the wires while the other reflects off the mirror, resulting in a 
difference in path length. Mathematically, we have 

/+cos(0) —sin(0) \ fl 0 \ /+cos(0) 

\Ey mt> J \ v +sin(6 l ) + cos(0)J yO -1 J \^-sin(0) 

+ cos(26 l ) + sin(20)\ 

- 8111 ( 20 ) cos(20) J 

so that in terms of the Stokes parameters, expressed below as the matrix-valued expectation 
value (Si < £*), so that 



+ sin(0) 
+ cos(0) 



( 88 ) 
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(89) 


By measuring only the component that varies with an angular frequency 40, we can measure 
the polarization without making two independent measurements of large numbers that are 
then subtracted from each other. The much larger Stokes intensity / does not mix because it 
remains constant in time. 


I (ideal) 


To see how polarization modulation helps, let us suppose that the vector P^ et 

/ t (actual) 
— Udet ’ 

(ideal) 


/ ideal) x~v (ideal) j j (ideal) xt-( 
Udet J Vdet ’ ^det J V d< 


is replaced by the vector P, ( ) t? ual \ with p taau,ll) 


det J det ? ^ det ? y det / ^ ^ det ? det 

^det , w det , y det J — v- 1 - t kyx det , where I is the identity matrix and e represents a 
hopefully small but unknown uncorrected residual error in the actual linear response of the 
detector pair. 

We discuss the setup for measuring polarization used in Planck as described above (see 
Fig. 23). We describe some of the possible systematic effects and how polarization modulation 
by means of a rotating half-wave plate would remove these systematic effects. Ideally the two 
orthogonal grids of wires would correspond, but for the direction of the linear polarization, to 
exactly the same circular beam on the sky and be perfectly calibrated, with no offset between 
the beam centers, nor any time-dependent drift of the differential gain. But in practice the 
beams are not identical and the gain and offset of the electronics drift in time. The centers 
of the beams do not coincide, and without corrections this would cause a local gradient of 
the T to be mistaken for E or B polarization. Differential cllipticity would cause the second 
derivative of T to leak into E and B. Moreover, the actual beams are more complicated and 
require three functions [i.e., maps of I(Cl), Q(Cl), and P(D)] for a complete description. All the 
above effects disappear, at least ideally, with the polarization modulation scheme described 
above. Polarization modulation can resolve or mitigate a lot of problems but is not a panacea. 
For example, polarization modulation cannot prevent E —s- B leakages. If there are any errors 
in the calibration of the angles of the direction of the linear polarization in the sky, these errors 
act to rotate E into B and vice versa. Moreover polarization modulation has been difficult 
to realize in practice for a number of reasons such as spurious signals due to microphonic 
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coupling. Both continuously rotating and discretely stepped half-wave plates are options for 
polarization modulation. 


O 



Figure 22: Polarization modulation by a rotating half-wave plate. We show a trans¬ 
missive rotating half-wave plate (HWP), here placed in front of a microwave feed horn pointing 
directly at the sky. However for modern CMB experiments, the horn would generally be re¬ 
placed with a telescope having intermediate optical elements and horns on the focal plane. 
The rotating HWP is birefringent, with its fast polarization axis represented by the line 00'. 
As explained in the main text, by measuring only that part of the signal varying at an angular 
frequency (where H is the angular velocity of the rotating HWP), one can prevent leakage 
from the much larger Stokes parameter / into the linearly polarized components Q and U. 
The presence of the HWP, while complicating the instrument design somewhat, allows many 
hardware requirements to be relaxed substantially compared to what would be needed without 
polarization modulation. 
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Figure 23: Planck polarization sensitive bolometers. Shown above is a photograph of 
one of the Planck polarization sensitive bolometers consisting of two grids of parallel wires 
oriented orthogonally and placed on top of each other but separated by a small distance. Each 
grid has its own thermistor, allowing the two selected components of the linear polarizations 
to be read out separately. ( Credit: ESA/Planck Collaboration ) 

15 CMB Statistics Revisited: Dealing with Realistic 
Observations 

The analysis of a CMB experiment going from the raw time stream of data taken, to sky 
maps at a given frequency, then to a clean sky map containing only the CMB, and finally to 
a likelihood function, whose input is a complete theoretical power spectrum, involves many 
steps and details. Space does not permit a complete discussion. The following Planck papers 
and references therein describe this analysis for the Planck experiment from the Planck 2013 
Results series. See in particular the following papers: Overview of products and results [ 165] . 
low frequency instrument data processing [183] . high frequency instrument data processing 
[ 184] . component separation |166j . and HFI energetic particle effects [185] . Similar descriptions 
can be found for other experiments. In this Section, we limit ourselves to sketching a few issues 
and to describing an idealized statistical analysis. 

One of the main steps in CMB analysis is constructing a likelihood for the data given 
a particular theoretical model. Relative likelihood lies at the heart of almost all statistical 
inference, whether one uses frequentist methods, Bayesian methods, or an ecumenical approach 
borrowing from both doctrines. Here we shall discuss only how to formulate the likelihood, as 
how to exploit the likelihood is less specific to the CMB and details can be found in standard 
treatises on modern statistics. (See for example Ref. [127 ] for a brief overview and Ref. [ 219] 
for an authoritative treatment emphasizing the frequentist approach. For Bayesian sampling 
as applied to the CMB, see Refs. H23 and E3J and references therein.) 

Constructing a likelihood is straightforward for an idealized survey for which the noise 
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in the sky maps is Gaussian and isotropic. For the simplest likelihood function, the input 
argument is a model for the CMB sky signal, or more precisely the parameters defining 
this model. This theoretical model defines an isotropic Gaussian stochastic process whose 
parameters one is trying to infer. In order to introduce some of the complications that arise, 
we start by writing down the likelihood C for the data given the model, or rather the variable 
—21n[£], which in some ways resembles y 2 in the Gaussian approximation. We have 


— 2 ln(£) = ^(2£+ 1) 
1=2 



B £ cJ T ' {th) + N e \ f Cj T ’ {sky) + N £ \ _ 1 

c/T^sky) + n £ J \B £ cJ T ’ {th) + n £ ) J 


(90) 


Here the parameters N £ represent the noise of the measurement. In the approximation of 
white instrument noise— that is, with no unequal time correlations—we have N £ = N 0 . The 
factor B £ represents the smearing of the sky signal due to the finite beam width. For the 
beam profile approximated as having a Gaussian shape, B £ = exp[— cf 2 £ 2 ] where the skv 
intensity map is convolved with the Gaussian function K{6) = (l/\/27rer 2 ) exp[— 0 2 / (2a 2 )]M 
Note that we ignore constant offsets in the log likelihood because for most statistical analysis 
only differences in the log likelihood are relevant. The presence of the normalization of the 
Gaussian imply that a quadratic representation of the log likelihood is not exact and higher 
order corrections are needed to avoid bias, especially at low i. 

To generalize the above expression to include polarization, we define the matrix 


C £ 



+ B £ ~ l Nj T 

r < te 


c™ \ 
Cf E + B e ~'N™ J 


(91) 


so that 


— 2 ln(£) = ]T(2£+ l){ln [ef h) {C ( ° hs) ) 1 }+tr[C < f bs \cf h) )- 1 ] - 2}. (92) 

r=2 

In order to generalize to less idealized (i.e., more realistic) situations, it is useful to rewrite 
the above results in a more abstract matrix notation, in which the expressions can be in¬ 
terpreted simultaneously as expressions in real (i.e., angular) space and as expressions in 
harmonic space. 


— 2 ln[£] = det(C tft + N) + t T (C th + N) -1 t. 


(93) 


Here the vector t represents the observed sky map, C th is the covariance matrix of the under¬ 
lying sky signal (in general depending on a number of parameters whose values one is trying 
to infer), and N represents the covariance matrix of the instrument (and other) noise. In 


the idealized case where everything is isotropic, this is simply a rewriting of Eq. (90), but in 
less idealized cases the likelihood in Eq. (93) is correct whereas Eqs. (90) and (92) are not 
applicable to the more general case. 

Let us now consider the effect of partial sky coverage, which may result from a survey 
that does not cover the whole sky or from masking portions of the sky such as the galaxy and 
bright point sources where contamination of the primordial signal cannot be corrected for in a 


u Note that in the literature, beam widths are generally specified in terms of FWHM rather than using the 
Gaussian definition above. 
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reliable way. If the vectors and matrices in Eq. (93) are understood as representations in pixel 
space, at least formally there is no difficulty in evaluating this expression in the subspace 
of pixel space representing the truncated sky. If the dimensionality of the pixelized maps 
were small, there would be no difficulty in simply evaluating this expression by brute force. 
Another complication arises from inhomogeneous noise. Most surveys do not cover the sky 
uniformly because requirements such as avoiding the sun, the earth, and the planets as well 
as instrumental considerations lead to nonuniform sky coverage. This leads to a noise matrix 
that has a simple representation in pixel space (under the assumption of white noise), but not 
in harmonic space. 

Unfortunately evaluating likelihoods such as in Eq. (93) exactly is not possible for full-sky 
surveys at high angular resolution such as the Planck survey for which pixelized maps include 
up to about IVpix = 5 x 10' pixels. While C t h is simple (i.e., sparse) in harmonic space, 
it is dense in pixel space, and the opposite holds for N (ignoring correlated noise), so no 
representation can be found to simplify the calculation in which all the matrices are sparse. 
Since even enumerating the matrices involves 0(N pix 2 ) operations, and inverting these matrices 
or taking their determinant requires 0(Ap ix 3 ) operations, it is immediately apparent that 
brute force will not work, and more clever, approximate techniques are required. Pixel based 
likelihoods are however used at low-£, where other approximations have difficulty representing 
the likelihood, in large part because the cosmic variance is non-Gaussian. 

While the likelihoods above assumed an isotropic Gaussian stochastic process for the un¬ 
derlying theoretical model generating the anisotropy pattern in the sky, these likelihoods can 
be generalized to include nonlinear effects, such as non-Gaussianity through f NL , gravitational 
lensing, and certain models of weak statistical anisotropy, to name just a few examples. Using 
the likelihood as a starting point, possibly subject to some approximations, has the virtue that 
one is guaranteed that an optimal statistical analysis will result without any special ingenuity. 

In the above simplified discussion, we have not touched on a number of very important 
issues, of which we list just a few examples: asymmetric beams, far side-lobe corrections, band¬ 
pass mismatch, errors from component separation, estimating noise from the data, and null 
tests (also known as “jackknives”). The reader is referred to the references at the beginning 
of this Section for a more complete discussion and references to relevant papers. 


16 Galactic Synchrotron Emission 

An inspection of the single frequency temperature maps from the Planck space mission (see 
Fig. 0, shown here in galactic coordinates so that the equator corresponds to the galac¬ 
tic plane, suggests that the 70 GHz frequency map is the cleanest, as the excess emission 
around the galactic plane is the narrowest at this frequency. But at lower frequencies, the 
region where the galactic contamination dominates over the primordial CMB temperature 
anisotropies widens. This contamination at low frequencies is primarily due to galactic syn¬ 
chrotron emission, resulting from ultrarelativistic electrons spiralling in the galactic magnetic 
held, whose strength is of order a few microgauss. (See for example Ref. [78].) This emission 
is generally described as nonthermal because the energy spectrum of charged particles is non- 
Maxwellian, especially at high energies. The electrons in question reside in the high energy 
tail of the electron energy spectrum, described empirically as a falling power law spectrum. 
Models of cosmic ray acceleration, for example arising about a shock from an expanding 
supernova remnant, are able to explain spectra having such a power law form [20]. If the 
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radiating charged particles were nonrelativistic, their radiation would be emitted at the cy¬ 
clotron frequency u c = eB/mc , with very little emission in higher harmonics, and given the 
measured values of the galactic magnetic field, one could not thus explain the observed high 
frequency emission. However, if the charged particles are highly relativistic, due to beaming 
effects the bulk of the radiation is emitted in the very high order harmonics, allowing the 
observed emission to be explained for reasonable values of the galactic magnetic field EHB- 
Although the data concerning the three-dimensional structure of the galactic magnetic field is 
at present in a very rudimentary state, and little is known observationally about the cosmic 
ray spectrum outside our immediate solar neighborhood, the physical mechanism underlying 
galactic synchrotron emission is well understood physics. One important point is that no 
matter what the details of the cosmic ray energy spectrum may be, the kernel of the integral 
transform relating this distribution to the frequency spectrum of the observed synchrotron 
emission applies considerable smoothing. Thus we can assume with a high degree of confi¬ 
dence that the synchrotron spectrum is smooth in frequency. Theory also predicts that this 
synchrotron emission will be highly polarized, at least to the extent that the component of the 
galactic magnetic held coherent over large scales is not negligible. This expectation is borne 
out by observations. It is customary to describe the synchrotron spectrum using a power law, 
so that its R-J brightness temperature may be fit by an Ansatz of the form 

Tr-j( Cl, v) = T r _j(Q , z/ 0 ) , (94) 

where empirically it has been established that a ~ 2.7-3.1. The same holds for the polarization— 
that is, the Stokes Q and U (and also V) parameters. In the ultrarelativistic approximation 
this emission is primarily linearly polarized, but there is also a smaller circularly polarized 
component suppressed by a factor of l/y. Variations in the spectral index depending on po¬ 
sition in the sky have been observed [58] , 

A description of the WMAP full-sky observations of synchrotron emission appears in the 
first-year WMAP foregrounds paper (238] and the polarization observations are described 
in the three-year WMAP paper on foreground polarization [244 j. A greater lever arm, in 
particular for studying the synchrotron spectral index, can be obtained by including the 
408 MHz Haslam map covering almost the entire sky [SI] . 


17 Free-Free Emission 

Another source of low-frequency galactic contamination is free-free emission arising from 
Bremsstrahlung photons emitted from electron-electron collisions in the interstellar medium, 
and to a lesser extent from electron-ion collisions. Like galactic synchrotron emission, the 
free-free emission brightness temperature falls with increasing frequency, but the fall-off is 
slower than that of the galactic synchrotron emission. 

Unlike synchrotron emission, free-free emission is not polarized. Ha (n = 3 —> n = 2) 
emission resulting from recombination of ionized atomic hydrogen can be used as a template 
for removing the free-free component, because the Ha emission likewise is proportional to 
the square of the electron density. However, because the Y Ha(T) and Pfree-freeC^, zz) do not 
have an identical temperature dependence, free-free removal using an Ha template has some 
intrinsic error. Yho{T) is defined as the constant of proportionality in the emissivity relation: 
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^Ha — r Ha(T)n e 2 where e# Q is a bolometric emissivity (power per unit volume) and n e is 
the density of free electrons. Likewise ef ree -free(i / ) = Tf ree -f r ee(T, u)n e 2 where the emissivity has 
units of power per unit frequency per unit volume per unit solid angle. 


18 Thermal Dust Emission 


In Sec. [16] we discussed galactic synchrotron emission, whose R-J brightness temperature rises 
with decreasing frequency, making it exceedingly difficult to measure CMB anisotropies at 
frequencies below ~ 20 GHz, especially close to the galactic plane where the galactic syn¬ 
chrotron emission is most intense. We also saw how the fact that the brightness temperature 
increases with decreasing frequency can be understood as a consequence of the synchrotron 
optical depth being much smaller than unity and increasing with decreasing frequency. Even¬ 
tually when the frequency is low enough so that the optical depth is near one, the brightness 
temperature stops rising and approaches a temperature related to the cosmic ray electron 
kinetic energy. 

This Section considers another component contributing to the microwave sky: the ther¬ 
mal emission from interstellar dust, whose brightness temperature increases in the opposite 
way—that is, the brightness temperature increases with increasing frequency. This behavior 
results because at microwave frequencies, the wavelength greatly exceeds the typical size of 
a dust grain. At long wavelengths, the cross-section for absorbing or elastically scattering 
electromagnetic radiation is much smaller than the geometric cross-section cr geom « a 2 where 
a is the effective grain radius. The dust grains can be approximated as dipole radiators in 
this frequency range because higher order multipoles are irrelevant. The dependence of the 
cross-section on frequency can be qualitatively understood by modeling the electric dipole 
moment of the grain as a damped harmonic oscillator where the resonant frequency is much 
higher than the frequencies of interest. In this low frequency approximation 

Cabs ~ (^) a 2 (95) 

for absorption, and for the elastic, or Rayleigh, scattering component, which becomes sub¬ 
dominant at very low frequencies, 


Celastic ~ 



(96) 


This is a simplistic approximation that we later shall see is not accurate, but it does provide 
intuition about the qualitative behavior expected. 

Our knowledge of interstellar dust derives from combining different observations spanning 
a broad range of frequencies, from the radio to the UV and even X-ray bands, in order to put 
together a consistent theoretical model able to account simultaneously for all the observations. 
(For a nice recent overview, see for example Refs. [4E] and P2J.) In this Section, we restrict 
ourselves to discussing the thermal microwave emission properties of the dust. 

An empirical law commonly used to model thermal dust emission at low frequencies is a 
Planck blackbody spectrum modulated by a power law emissivity, or graybody factor 
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where the Planck spectrum is given by 


B(w, T) 


2 hu 3 1 



(98) 


The physical basis for this modification to the Planck spectrum can be partially motivated 
by the following argument based on the linear electric susceptibility of the dust grain, following 
a line of reasoning first suggested by EM Purcell in 1969 1192! . The simple argument, subject 
to a few caveats, suggests that j3 — 2. However the observational data do not bear out 
this prediction and instead are better fit by a power law with f3 ~ 1.4 — 1.6 [1371 1162] . 
Let x(ca) represent the linear susceptibility of a dust grain, which as a consequence of the 
Kramers-Kronig dispersion relations must include both nonvanishing real and imaginary parts. 
Here d(ca) = x(o;)E(o;), where d(ca) and E(cu) are the grain electric dipole moment and 
the surrounding electric field, respectively. As a consequence of causality, x(ca) is analytic 
on the lower half-plane. Its poles in the upper half-plane represent those decaying mode 
excitations of the grain that are coupled to the electromagnetic field. Moreover, on the real 
axis y(—ca) = [x(-kd)]*. If we assume analyticity in a neighborhood of the origin, we may 
expand as a power series about the origin, so that 


x(uj) =uj 0 + ixiu + X2U 2 H-, 


(99) 


and X 01 Xi, X 2 , ■ ■ • are all real. It follows that the energy dissipated is characterized by the 
absorptive cross-section 

Anulm[x(uj)) 
c 

and at low frequencies 

Oabs(w) « (^—^J u 2 , (101) 

which corresponds to an emissivity index /3 = 2 as cu —» 0. The perhaps questionable assump¬ 
tion here is that there exists a circle of nonzero radius about the origin of the ca-plane where 
x(ca) has no singularities. If we postulate that there is an infinite number of poles of suitably 
decreasing strength whose accumulation point is oj = 0 , we may evade the conclusion that 
asymptotically as u> —* 0, /3 —>• 2. For an insightful discussion of how such low-frequency poles 
could arise, see C. Meny et al. mi- 


( 100 ) 


19 Dust Polarization and Grain Alignment 

The fact that interstellar dust grains are aligned was first seen through the polarization of 
starlight at optical frequencies in 1949 [771 [871 [88] , Since this discovery an abundant literature 
has been amassed elucidating numerous aspects of dust grain dynamics. The best bet solution 
to the alignment problem has evolved over the years as additional relevant physical effects 
were pointed out. It is unclear whether the current understanding will remain the last word 
on this subject. The most naive solution to the alignment problem would be some sort of 
compass needle type mechanism with some dissipation. However, it was discovered early on 
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that in view of the plausible magnitudes for the galactic magnetic held, such an explanation 
cannot work [214] , The theoretical explanation that has evolved involves detailed modeling 
of the rotational dynamics of the grains, which as we shall see below almost certainly rotate 
suprathermally. 

For microwave observations of the primordial polarization of the CMB at ever greater 
sensitivity, the removal of contamination from polarized thermal dust emission is of the utmost 
importance. This fact is highlighted by the discussion of how to interpret the recent BICEP2 
claim (discussed in Sec. 11.1) of a detection of a primordial B mode signal based on essentially 
a single frequency channel. This interpretation however seems unlikely given subsequent 
Planck estimates of the likely dust contribution The problem is that little data is 

presently available on polarized dust emission at microwave frequencies. Current models of 
polarized dust involve a large degree of extrapolation and are in large part guided by simplicity. 
From a theoretical point of view, polarized dust could be much more complicated than the 
simplest empirical modeling suggests. For example there is no a priori reason to believe 
that polarized emission from dust must follow the same frequency spectrum as its emission 
in intensity. It could well be that the weighting over grain types gives a different frequency 
dependence, or that as more precise dust modeling is called for, the need to include several 
species of dust with spatially nonuniform distributions will become apparent. Moreover, based 
on theoretical considerations, it is not clear that there should be a simple proportionality 
between polarization and the strength of the magnetic held. If the BICEP2 B mode signal 
is conhrmed as a dust artefact, we could see ourselves in the coming years trying to detect 
values of the tensor-to-scalar ratio around r « 10~ 3 , which would require exceedingly accurate 
polarized dust removal from the individual frequency maps, and before this happens a wealth 
of new observational data on polarized dust emission will become available. If, on the other 
hand, a detection of B modes at a large value of r is made, as one tries to map out the precise 
primordial B mode spectrum, for example to measure n? and thus check the consistency 
between Vs(k) and Vr(k), the need for accurate dust removal will likewise be great, but for 
slightly different reasons. 

Since the observational inputs are likely to change, here we particularly emphasize the 
underlying physics of the alignment, which may be updated, but is likely to change less. The 
theoretical issues of dust alignment have so far not been widely discussed among those not 
specialized in the ISM (interstellar medium). Nevertheless, we believe that the questions 
raised will become increasingly relevant for CMB observations in the future, and this is why 
a thorough discussion is presented here. 

We shall find that the alignment of interstellar dust grains is closely linked to the rotational 
dynamics of a dust grain. We therefore start with a study of the rotation of a dust grain. 
Mathematically, the simplest case to analyze is a prolate (needle-like) or oblate (flattened 
spherical) dust grain with an axis of azimuthal symmetry, which constitutes a symmetric 
top. In this case, the motions can be solved in terms of elementary functions. For the more 
general case of an asymmetric grain (where none of the three principal values of the moment 
of inertia tensor are equal) integration of the equations of motion in the absence of torques is 
more difficult, but can be achieved using Jacobian elliptic functions 


In the noninertial frame rotating with the dust grain, which is convenient because in this 
frame the moment of inertia tensor is constant in time, two types of inertial forces arise: 

(i) a centrifugal force, described by the potential V c tr (x)/m = —12 2 (t)x 2 + (12(f) • x) 2 , and 

(ii) a Coriolis force F cor /m = —212(f) x v, which for our purposes is unimportant because 
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the velocities are negligible in the co-rotating frame. In order to understand the alignment 
process, we must model the solid as being elastic with dissipation included. 

19.1 Why do dust grains spin? 

A flippant answer to the above question might be: Why not? If we can associate a temperature 
with the three kinetic degrees of freedom of the dust grain, we would conclude that 

-Jcu 2 ~ -k B T rot . (102) 

Early discussions of rotating dust grains supposed that the rotational degrees of freedom are 
subject to random torques, so that 


(r) = 0. (103) 

Later it was realized that suprathermal rotation is plausible, where the implied temperature of 
the rotation can greatly exceed the effective temperature of any of the other relevant degrees 
of freedom. 

The question now arises what temperature one should use for T ro t • A natural candidate 
might be the kinetic temperature of the ambient gas T^ n . Indeed one expects molecules of the 
ambient gas to collide with the dust grains, and such collisions do provide random torques, 
causing the angular momentum to undergo a random walk accompanied by a dissipation 
mechanism, as must be present to prevent the angular velocity from diverging with time. 

19.2 About which axis do dust grains spin? 

This question has two parts. First, we may ask what is the alignment of the co-rotating 
internal (body) coordinates with respect to the angular momentum vector. Second, we may 
ask how the angular momentum aligns itself with respect to the inertial coordinates of the 
ambient space. In this Section, we consider only the first question, postponing the second 
question to a later section. 

But for exceptional situations where two or more eigenvalues of the moment of inertia 
tensor coincide, the principal axes of moment of inertia tensor provide a natural set of axes 
for the dust grain, and we may order the eigenvalues so that Ii < I 2 < h- We argue that the 
dust grain tends to align itself so that its J 3 axis is parallel to the angular momentum. This 
orientation of the grain minimizes the rotational energy subject to the constraint that the total 
angular momentum remains constant. If the grain starts in a state of random orientation, the 
grain wobbles. While L remains constant, 12(f) = I _ 1 (f)L (expressed in inertial coordinates) 
does not, but rather fluctuates, exciting internal vibrational modes that dissipate. As the 
excess rotational energy is dissipated, an alignment as described above is achieved])] The 
relevant question is how this time scale compares to other time scales of the grain rotation 
dynamics. It turns out that this time scale, which we denote t/_l, is short. In the equations 

v Much to the dismay of NASA engineers, this is exactly what happened to the Explorer 1 rocket, the 
first US satellite launched in 1958 following Sputnik, due primarily to dissipation from its whisker antennas. 
The elongated streamlined cylinder was initially set in rotation about its long axis, but subsequently, after 
an initial wobble phase, ended up spinning about an axis at right angles in the body coordinates. Since then 
rotational stability, also relevant in ballistics, has become a carefully considered design issue for spacecraft. 
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that follow, we exploit the shortness of 7y_£ compared to the other relevant time scales. This 
hierarchy of time scales allows us to simplify the equations by excluding the degrees of freedom 
that describe the lack of alignment of the principal axis of maximum moment of inertia from 
L. leading to a considerable simplification of the formalism. 


19.3 A stochastic differential equation for L (t) 

Mathematically, we may describe the combined effects of the torque from random collisions 
and its associated damping through the coherent torque using a stochastic differential equation 
of the form 

i(t) = -a (l(():~ ifHi) + r»»W. (104) 

In this equation, the torque described by the first term is coherent with respect to the body 
coordinates of the dust grain. Later we shall also consider torques that are coherent with 
respect to the ambient inertial coordinates, which must be modeled in a different way. Here 
the stochastic Gaussian forcing term has the expectation values 


(r ran ,i(t)) = o, 

(r r an,«(i)r i an= A*L<M(t - t'), 


(105) 


where the angular momentum diffusion coefficient has units [//£,] = (Torque) 2 /(Time). 

Let us analyze the stationary ensemble defined by the above equation—in other words, the 
probability function p( L) toward which the biased random walk defined by the above equation 
evolves after a long period of time. In the low-temperature limit, |L| ~ L, with negligible 
fractional fluctuations, and the direction of the angular momentum vector nL = L(f)/|L(f)| 
follows a random walk on the surface of the sphere with diffusion coefficient fin = Pl/L 2 . 
In the opposite high temperature limit, the L term is a sideshow or small correction, and 
the main effect is the competition between the first and second terms leading to a thermal 
ensemble, which for an isotropic moment of inertial tensor would take the form 



exp[— /3L 2 ], 


(106) 


where {3 = a/pL- Without other sources of dissipation or random torques, we expect that 
(3 = l/(2fc B T kin / max ) where J max is the largest eigenvalue of the moment of inertia tensor of 
the particular grain being modelled and Thin is the kinetic temperature of the ambient gas. 
It is straightforward to generalize Eq. (106) to the intermediate case. It was however later 
noted that a number of mechanisms exist for which (T) ^ 0, which can give rise to what is 
known as “suprathermal rotation” [193j —that is, rotation at an angular velocity larger than 
any temperature associated with the other degrees of freedom interacting with the dust grain. 


19.4 Suprathermal rotation 


To see how such suprathermal rotation might come about, consider the caricature of a dust 
grain as sketched in Fig. 24 Let us assume that the dust grain emits radiation isotropically 
in the infrared, but that in the visible and UV bands radiation is absorbed or reflected by the 
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Figure 24: Coherent torques from grain radiation pressure. 


surfaces indicated in the figure. Let us further assume an isotropic visible or UV illumination. 
The blackened (perfectly absorbing) faces suffer a radiation pressure equal to half that of 
the mirrored surfaces, giving a coherent torque in the direction indicated in the figure. The 
presence of two distinct temperatures, reminiscent of a thermal engine, is crucial to avoid 
thermal equilibrium for the rotational degree of freedom. As in the thermodynamics of an 
engine, there is a heat flow from a hotter temperature to a colder temperature with some of 
the energy being siphoned off in the form of work exerted on the three rotational degrees of 
freedom of the dust grain. Other possibilities for coherent torques result from photoelectron 
emission at privileged sites and the catalysis of molecular hydrogen on favored sites on the 
grain surface. In the former case, the incident UV photon is almost pure energy due to 
its masslessness, but the emitted electron carries substantial momentum for which there is 
a recoil. Molecular hydrogen forms almost exclusively through catalysis on a grain surface. 
Unlike in ordinary chemistry where truly two-body collisions are an exception and activation 
barriers prevent reactions from establishing equilibrium, for molecular hydrogen formation in 
the cold interstellar medium, the bottleneck is the lack of three-body collisions and the lack 
of an efficient mechanism for carrying away the excess energy, so that the hydrogen atoms 
combine rather than simply bouncing off each other. In principle, the energy can be lost 
radiatively, but the rate for this is too slow. Given that 4.2 eV is liberated, the recoil at 
selected sites can produce a substantial coherent torque on the grain. 

19.5 Dust grain dynamics and the galactic magnetic field 

Much of the dust grain rotational dynamics does not single out any preferred direction. To 
explain the polarization of starlight and of the thermal dust emission at microwave frequencies, 
some mechanism is needed that causes the dust grains to align themselves locally at least along 
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a common direction such as that of the magnetic field. 


19.5.1 Origin of a magnetic moment along L 

Two effects generate a magnetic moment along the axis of L: 

(i) Inhomogeneous electric charge distribution in the dust grain. The second moment 
of the electric charge density about the principal axis of the largest eigenvalue of the moment 
of inertia tensor (denoted z) induces a magnetic dipole moment as the grain rotates about 
this axis, according to 


fi = z£l z J d 3 rp e (r)(x 2 + y 2 ), (107) 

where p e (r) is the electrostatic charge density of the dust grain. In general, and particularly 
when the grain structure is amorphous, the net charge density is nonvanishing, so the integral 
above does not vanish. This effect is known as the Rowland effect. 

(ii) The Barnett effect. Minimizing the free energy tends to align the free spins of a 
rotating body with the direction of rotation. In nonferromagnetic substances this effect is 
usually negligible, but rotating dust grains present an extreme environment because of their 
minute size and isolation from their environment. Interstellar dust grains are among the 
fastest rotating macroscopic objects known. 

To understand the origin of the Barnett effect, consider an isolated rotating body, whose 
conserved total angular momentum may be decomposed into two parts 

Ltot = Lbuik T Sf ree . (108) 

We consider the energetics of the exchange of angular moment between the bulk and spin 
degrees of freedom. We assume the spins to be free neglecting any interactions between them. 
In this case, the Hamiltonian for the system is 

rr L bulk (Ltot ~ S) 2 
2 1 21 

and retaining only the term linear in the spin, we obtain 

H = -n- s, 

where ft(t) = I _1 (f)L is the angular rotation velocity. The spins want to align themselves 
with the rotation to lower the rotational energy mg. So far we have not included any 
terms opposing complete alignment, so at T = 0 complete alignment would result with S = 
S availableI"! /101. But at finite temperature entropic considerations oppose such alignment, and 
it is the free energy F = H — TS{ S) or 

F(S; fi) = — ft ■ S + aTS 2 (111) 

that is minimized. Far from saturation, when the degree of alignment is small, this quadratic 
term provides a good approximation to the entropy of the spin system. 


(109) 

( 110 ) 


71 






19.6 Magnetic precession 

Having established that a spinning dust grain has a magnetic moment parallel to its angular 
momentum vector, which is aligned with the principal axis of the largest eigenvalue of the 
moment of inertia tensor, we now consider the effect of the torque resulting from the interaction 
with the ambient magnetic field. The interaction Hamiltonian is 

B, (112) 


from which it follows that 


L = /ixB = 7 mL x B. 


(113) 


In other words, the angular momentum vector precesses at a frequency tap re c = 7 m|B| around 
the magnetic field. Under this precession, the angle between L and /i remains constant, 
meaning that this effect neither aligns nor disaligns the grain with respect to the local magnetic 
field direction. 

We estimate the time scale of this precession. Let S denote the spin polarization den¬ 
sity (angular momentum per unit volume), which under saturation (i.e., all free spins aligned) 
would have a magnitude iS sat , whose order of magnitude can be estimated by assuming approx¬ 
imately Avogadro’s number IV 4 of spins having a magnitude h in a cubic centimeter volume, 
giving iS sat ~ 0(1)(4 x 10 _4 )erg • s • cm -3 . We obtain the following free energy density (per 
unit volume): 

F„(S;T)= 1(4bT)AC p ,„ • (114) 


Here for a paramagnetic substance the order of magnitude for A/" sp m might be around 10 24 cm -3 
but for a ferromagnetic substance could be much smaller, corresponding to the density of 
domains. Minimizing the free energy with respect to «S gives the spin polarization density 
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We express the constant of proportionality as if it were a moment of inertia because it has 
the same units. 
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19.6.1 Barnett dissipation 

The coupling in the Barnett effect between the bulk and spin angular momenta does not act 
instantaneously. Rather as the instantaneous angular velocity 12(f) changes, there is a lag in 
the response of S(t). In his landmark 1979 paper [193] . Purcell pointed out that this lag would 
help align the body coordinates so that the axis of maximum moment of inertia is parallel 
to the angular momentum vector. Purcell coined the name Barnett dissipation for this effect, 
whose magnitude is greater than the mechanical dissipation mentioned above. 
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Two types of free spins participate in the Barnett effect: the electronic spins and the nuclear 
spins. The electronic spins are much more relevant for establishing a magnetic moment along L 
as discussed above in Sec. 19.5.1 because electronic magnetic moments are larger than nuclear 
magnetic moments by a factor of about (■ m p /m e ). But for dissipation, this strong coupling is 
a liability. The nuclear spins contribute predominantly to Barnett dissipation because of their 
longer lag time. 

Mathematically, the coupled equations 


S(f) = —y(S(f) - aft(t)), 
L(t) = +7(S(t)-an(t)), 


(117) 


describe this effect. We may integrate out S (t) to obtain the following equation in inertial 
coordinates for the torque on L resulting from the exchange of angular momentum between 
the bulk and spin degrees of freedom 

/•OO 

L(£) = —cxyfflf) + cry 2 / dr exp[—yrjSflf — r). (118) 

Jo 

To calculate how fast this exchange of angular momentum aligns the axis of maximum 
moment of inertia with the angular momentum vector, we calculate the averaged rotational 
energy loss rate of the bulk degrees of freedom retaining only secular contributions. This 
calculation is simpler than keeping track of torques. The degree of alignment of f2(f) with 
the axis of maximum moment of inertia is a function of the rotational energy in the bulk 
rotational degrees of freedom. The instantaneous power transferred from the bulk rotation to 
the spin degrees of freedom is given by 


Pi oss (t ) 


n(t) ■ l (t) = -n(t) • s(t) 

-7/ s O(i) -7 jft(t) -7 


'OO 


dt' exp[—^t']Ct(t — t') 


(119) 


and to obtain the averaged or secular contribution, we decompose f l(t) into harmonic com¬ 
ponents of amplitude O n and frequency oj a , so that 

(n(t) ■ jn(£)-7^ dt'exp[- 7* / ]fS(t-f / ) 2 - ( 120 ) 


It follows that 


(floss) — fl' 


Wa 


7 + u a - 


( 121 ) 


We see that the stationary component (with oj a = 0) does not contribute to the loss but the 
precessing components dissipate. 

We obtain an order of magnitude for the alignment rate by assuming that (5 1 )// m 0(1) 
and that except for the stationary component u a ~ cu max = L// max , so that the excess energy 
is approximately 


0 ( 1 )/, 


2 • 2 

max^max SID. I 


( 122 ) 
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where 6 is the disalignment angle, and the rate of dissipation is approximately 


so that 


Pi o SS ~ o(iy/i s ujl 




7 2 +<W 


sin 2 0, 


7align ~ 7 


Is 


UJ „ 


h 7 2 + S 


( 123 ) 


(124) 


The action of the imperfect coupling of the spins to the bulk angular momentum and bulk 
rotational velocity is somewhat analogous to that of a harmonic balancer on a rotating shaft 
(such as a crankshaft), a common device in mechanical engineering. Here however the weakly 
and dissipatively coupled flywheel is replaced by the internal spin degrees of freedom. 


19.7 Davis-Greenstein magnetic dissipation 

If a grain rotates with an angular velocity along the direction of the magnetic field, in the frame 
co-rotating with the grain, the magnetic field is time independent. However the component of 

the angular velocity in the plane normal to the magnetic field gives rise to an oscillating 

magnetic field of angular frequency Oj_. In general, the magnetic susceptibility at nonzero 
frequency has a nonzero imaginary part Xm,i, where 

Xm(^) Xm,r(^) T ®Xm,i(^)' (125) 

The lag in the response of the magnetization to the applied driving field gives rise to dissipation 
accompanied by a torque that tends to align (or antialign) the spin with the magnetic field. 
This effect is known as the Davis-Greenstein mechanism |40j . 

The alignment tendency of the Davis-Greenstein mechanism may be described by the pair 
of equations 

Ll =-— Ll, L||=0, (126) 

t~dg 

where the decay time tdg indicates the time scale of the Davis-Greenstein relaxation process. 
In deriving this simple form, we have assumed that r prec <C t dg , so that we may average over 
many precessions. We also assume that the alignment with the axis of maximum moment of 
inertia likewise is very fast. If these assumptions do not apply, an averaged equation of the 
simple form above does not hold, and these effects need to be analyzed simultaneously. 

A key issue in evaluating the viability of the Davis-Greenstein mechanism is the frequency 
of reversals of the direction of the coherent torque, termed “cross-over” events in the literature. 
If we consider for example coherent torques from the formation of molecular hydrogen at a 
small number of catalysis sites on the grain surface, the orientation of this torque depends 
on the structure of the outermost surface atomic monolayer, and its structure is likely to 
change as new layers are deposited or as old layers are eroded away. Consequently, the long 
term coherence time of the “coherent” torque is uncertain. Under the assumption that the 
alignment of the body coordinates with the angular momentum is fast, it is the evolution of the 
coherent torque projected onto the axis of maximum moment of inertia that matters. While 
the grain rotates suprathermally, its direction is relatively stable against random perturbing 
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torques, but when its rotation velocity is in the process of changing sign and of the same order 
as the would-be “thermal” rotation velocity, the grain is particularly susceptible to changing 
orientation as a result of random torques as discussed in detail in Refs. |213j and jllfij . 

In deciding whether t bg is too long for producing a significant degree of alignment, we 
must consider competing effects that tend to disalign the grains, which modify the above 
equation to become a stochastic differential equation of the form 

L =-— (L - e B (e B ■ L)) - — (L - le L ) + -^-n {t) (127) 

T~DG Rel T a 

rand 

where e B = B/|B|, e B = L/|L|, and n(f) is a normalized white noise source whose expectation 
value is given by 

(' rii(t ) rijit')) = 8(t - t'). (128) 

If R e i ^ Rand, hr the absence of a magnetic field, L undergoes a random walk on the surface 
of a sphere of radius L. If t dg r ran d the magnetic dissipation is strong enough to bring 
about almost complete alignment. However if, on the other hand, t B g Rand, the magnetic 
dissipation leads to a negligible amount of alignment. 


19.8 Alignment along B without Davis-Greenstein dissipation 

Because of the magnetic precession discussed in Sec. |19.6[ an average alignment with the 
magnetic field direction (or more precisely with a direction very close to the magnetic field 
direction) can be obtained if there is a tendency toward alignment with some other direction 
characterized by a time scale t a ii g n where t a . ii gt1 t piec . The effect of precession about an axis SI 
(presumably along ±B) in competition with alignment about another direction n is described 
by the ordinary differential equation 

L = S7 x L — 7 (L — n L). (129) 


This equation has two time scales: Q 1 and 7 . We are primarily interested in fast precession 


i.e., SI 7 ). Equation (129) is a linear equation whose solution decomposes into a stationary 


part and transient spiralling in to approach the stationary attractor. We solve 
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(131) 


In the fast precession regime of most interest to us, L = Lfi(O ■ n)/fl 2 , with corrections 
suppressed by a factor ( 7 /fI). The precession has no alignment tendency of its own, but if the 
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precession period is shorter than the time scale of the alternative alignment mechanism, the 
precession acts to suppress alignment in the plane normal to f 2 . 

If it turns out that radiative torques combined with the fast precession described above 
is the solution to the alignment problem, a number of interesting observational consequences 
follow. Blind data analysis might suggest that one should be able to predict polarization 
from the thermal dust emission intensity combined with a map of the galactic magnetic held, 
perhaps deduced from rotation measures and synchrotron emission. However, if the above 
mechanism is correct, this procedure will not work, even with a three-dimensional model of 
the unpolarized dust emission and galactic magnetic held. In the above equations, provided 
that the galactic magnetic held is strong enough, its direction serves to define the direction of 
the grain alignment but its precise magnitude has little relevance. The above mechanism can 
also accommodate a large amount of small-scale power in the polarized dust signal, even if 
both the galactic magnetic held and the dust density are smooth. One might expect the held 
characterizing the radiative alignment tendency Z(x)n(x) to include an appreciable compo¬ 
nent varying on small scales. 

19.9 Radiative torques 

Electromagnetic radiation scattered or absorbed by the dust grain in general exerts a torque 
on the dust grain, as already modeled in Sec. |19.4| for isotropic illumination, which is sim¬ 
pler because the torque direction is constant in the co-rotating body coordinates. Here we 
generalize to anisotropic illumination. 

Let T(is,h) expressed in body coordinates be the torque exerted on a dust grain by 
monochromatic electromagnetic radiation of frequency is and unit spectral radiance propa¬ 
gating in the h direction. For an illumination described by the spectral radiance I (is, n), the 
torque exerted on the dust grain in inertial coordinates is given by 

dis [ 

Js 2 

The complication here arises from the need to include both inertial and co-rotating body 
coordinates, related by the transformation x iner = 77(f)xb 0 dy- 

Assuming that the time scale for radiative torques is long compared to the moment of 
inertia alignment time scale t/_£, we may take fl to be aligned with L and dehne an averaged 
kernel T(v, fif,n L ), defined as 



dnT(is, lZ(t)fi)I(is , h). 


(132) 


1 f 27C 

r(z/,n 7 ;n L ) = — J dxT(is,R ( x ,n L )fii), (133) 

where R( X ^ L ) represents a rotation by x about the n L axis. Apart from a few general obser¬ 
vations, little can be said about the form and properties of the radiative torque kernel, which 
differ from grain to grain. At very long wavelengths—that is, long compared to the size of the 
dust grain—it is a good approximation to treat the grain as a dipole (as one does for Rayleigh 
scattering). In the dipole approximation there are no torques. Therefore, (a/ A) serves as an 
expansion parameter for which the torque approaches zero as this parameter tends toward 
zero. Likewise a spherical grain gives a vanishing torque at any frequency as the result of 
symmetry arguments. 
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Convolving the incident radiation intensity I(u,hi) with this kernel, we obtain the angle- 
averaged torque r to t,rad(^n)- If there were no magnetic held, we could incorporate the torque 


above into the previously derived stochastic differential equation for the evolution of L [Eq. (104) 
to obtain 


L(t) = -/3L(t)+/3L 


L (t) 

ILCOI 


+ r 


tot, rad 


L (t) 
|L(i)l 


“1“ ^ran(^) ? 


(134) 


where a coherent torque in body coordinates has been included as well. The above equation 
describes a random walk L(f) in three-dimensional angular momentum space. Owing to the 
stochastic random torque noise term, the rotational degree of freedom loses memory of its 
initial state after some characteristic decay time t ran , after which its state can be described 
by a probability distribution p(L) calculable based on the above equation. The stochastic 
equation above has the general form 


L = v(L) - n(i), 


(135) 


for which the associated Fokker-Planck equation describing the evolution of the probability 
density is 


= v 2 p(L) + V . (p(L)v(L)). (136) 

With the magnetic held included, we add the magnetic precession term to obtain 

L (t) = —/3L(t) + / 3L + rtot,rad ^j^- 

where the constant 7 mag relates the magnetic moment to the angular momentum according to 
l^B = O'magL- 


it) 


(*)l 


T 7magL x B T n ran (t), 


(137) 
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Figure 25: Selection of grain shapes for studying radiative torques. In Ref. [53], Draine 
and Weingartner calculated the radiative torque on a dust grain as a function of frequency 
v and incident photon direction n for the three grain shape illustrated above. ( Credit: B. 
Draine and J. Weingartner ) 


When Ran, Rad Rrec, it is possible to reduce the dimension of the stochastic differential 
equation to two dimensions, namely the angular velocity c a and the angle 6bl between the 
vectors L and B, obtaining 

9bl — F(0bliW) + n$(t), ui = G(9 bl,u) + n u (t). (138) 


Unfortunately, very little can be said about the form of the functions F(9bl,u) and G(9bl, w), 
which depends on the detailed grain shapes and other properties. Draine and Weingartner 
[531 153 .1 1231] studied three possible irregular grain shapes (shown in Fig. 25), calculating 
numerically the radiative torque as a function of illumination direction and frequency. Then 
assuming a certain anisotropic illumination, they calculated the functions F and G in Eq. (138) 
and studied the solutions to this equation numerically establishing the orbits in the absence 
of noise. Their simulations conclude that radiative torques can provide a plausible alignment 
mechanism. 

The proposal that radiative torques resulting from anisotropic illumination align the in¬ 
terstellar dust grains with the magnetic field offers a promising alternative to the Davis- 
Greenstein magnetic dissipation mechanism whose characteristic time scale may be too long 
to provide the needed degree of alignment. However, its viability is hard to evaluate given 
our ignorance concerning the shapes of the dust grains. Without such shape information, the 
best case that can be put forward in favor of this scenario based on radiative torques and 
anisotropic illumination is to show that some candidate shapes exist that give the right orders 
of magnitude combined with the argument that the Davis-Greenstein scenario cannot account 
for the data. It is discouraging that it is not possible to formulate precise predictions for the 
radiative torque scenario. The origin of grain alignment seems to remain an open question. 
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19.10 Small dust grains and anomalous microwave emission (AME) 

The thermal emission of interstellar dust grains discussed above was based on the assumption 
that its thermal state may be characterized by a mean temperature (around 20 K) and that 
fluctuations about this temperature subsequent to the absorption of an UV photon constitute 
a negligible correction. If a dust grain is sufficiently large, this is a good approximation, but for 
small molecules an FUV (far ultraviolet) photon can raise the grain temperature momentarily 
by an order of magnitude or more, allowing emission in bands where there would be almost 
no emission for a grain at its average temperature. The bulk of the energy absorbed from the 
UV photon is then rapidly re-emitted through IR fluorescence as the molecule rapidly cools 
back down to its mean temperature. Hints of such emission requiring a temperature signifi¬ 
cantly above the mean temperature were noted through the observation of aromatic infrared 
emission features consistent with C-H bonds on the edge of an aromatic molecule—that is, 
around 3.3 and 11.3 microns mm- Later, analysis of the IRAS dust maps showed that a 
significant fraction of the UV photons absorbed through extinction were re-emitted in bands 
of wavelength shorter than 60 /i, where under the assumption of a reasonable average temper¬ 
ature and a modified blackbody law there should be no emission [190]. As a consequence, the 
dust models were revised to extend the size distribution to include small grains thought to be 
PAHs as a result of the infrared emission features described above. PAHs (polycyclic aromatic 
hydrocarbons) are essentially small monolayer planar sheets of graphite with hydrogen frills 
on their edges. 

A template for the small grain population can be constructed using maps of infrared 
emission at A < 60/r where the thermal emission from large grains does not contribute. Since 
the short wavelength emission from the dust grains is proportional to the local FUV flux 
density, these templates can in principle be improved by dividing the local FUV flux density 
to obtain a better map of the projected small dust grain column density, which in turn can 
be correlated with the AME (anomalous microwave emission) at low frequencies. 

It was also pointed out that such a population of very small dust grains would be expected 
to rotate extremely rapidly, giving rise to electric dipole emission at its rotation frequency 
and possibly at higher harmonics, since P e ~ /iso; 4 and u> 2 ~ (fe7r 0 t)/ma 2 in the absence 
of suprathermal rotation [SOI ED E]- The possibility of magnetic dipole emission from small 
rotating grains has also been investigated [52] • 

Observationally, emission at low frequencies, where diffuse thermal dust emission should be 
minuscule, was found correlated with high frequency dust maps |42j, 106 . ! 120j . Such emission 
would be consistent with dipole emission from small rapidly rotating dust grains, as described 
above. As an alternative explanation for such a correlation, it was proposed that regions of 
free-free emission could be correlated with regions of thermal dust emission. However, this 
low frequency excess was found not to be correlated with Ha line emission, which serves as a 
tracer of free-free emission. One possibility to weaken such a correlation would be to postulate 
an extremely high temperature, so that rf ree _f re e(T)/r^ Q , is large. However, it was found that 
such a high temperature would require an implausible energy injection rate to prevent this 
phase from cooling. 

The spectrum of anomalous microwave emission, presumably from spinning dust, has been 
measured by taking spectra in targeted regions having a large contribution from AME and 
subtracting the expected synchrotron, free-free, thermal dust, and primordial CMB signals 
[ 2291 86]. Figure [26] shows the spectrum thus obtained. The observed fall-off at low v disfavors 
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Figure 26: Consistency of anomalous microwave emission with spinning dust. The 

multi-frequency observations shown here were taken for a molecular cloud believed to have 
strong anomalous dust emission. We observe that given the shape a linear combination of ther¬ 
mal dust and free-free emission cannot explain the observed shape. However, if an anomalous 
dust component with the spectral shape predicted by Draine and Lazarian for electric dipole 
emission from spinning dust is included (dotted curve), a reasonable fit may be obtained. For 
more details, see Ref. [229]. ( Credit: COSMOMAS Collaboration) 


the hypothesis of free-free emission correlated with thermal dust emission as the origin of the 
anomalous emission, because in that case, the spectrum of the residual should rise as v is 
lowered. 

Most theoretical studies have assumed that the small grains do not become aligned with 
the galactic magnetic field, but alignment mechanisms for this grain population have not 
yet been investigated in detail. Observational limits on the anomalous microwave emission 
constrain the degree of polarization to less than a few per cent [ 198j . 


20 Compact Sources 

The above processes pertain primarily to diffuse galactic emission, which contaminates the 
primordial CMB signal most severely at low multipole number i. On smaller scales, however, 
compact sources become more of a concern and eventually become the dominant contribution 
at all frequencies. We use the term “compact” to denote both unresolved point sources and 
partially resolved, localized sources. 

The angular power spectrum of point sources distributed according to a Poissonian dis- 
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tribution would scale with multipole number as £°. If ignoring the structure of the acoustic 
oscillations, one approximates the primordial CMB spectrum as having the shape £~ 2 , one 
would conclude that compact sources rapidly take over at high l. Because of Silk damping 
combined with the smoothing from the finite width of the last scattering surface, the fall-off of 
the CMB at large i is even more rapid. The fact that point sources are clustered rather than 
distributed in Poissonian manner causes the point source power spectrum to be clustered, 
leading to an excess of power at small i compared to an £° power law. 

20.1 Radio galaxies 

Section[l6]described the physical process of synchrotron emission, emphasizing its contribution 
to the diffuse emission from our own galaxy. However the same process is also at work in 
galaxies other than our own, and in particular in the rare, but highly luminous radio galaxies 
at high redshift, extreme processes fueled by a central black hole give rise to intense emission 
in many bands, including synchrotron emission in the radio and microwave bands. For our 
purposes, the most salient and relevant feature of these sources is their inverted spectrum. 
For a detailed survey of radio sources, see Ref. E2B- Since each source differs slightly in its 
spectral properties, the strategy for dealing with these sources is to mask out the brightest 
sources and then model the residual arising from the remaining unmasked sources. 


20.2 Infrared galaxies 


In Sec. 18, we discussed at length the contamination from thermal emission from the inter¬ 


stellar dust in our galaxy. Thermal dust emission, however, is not restricted to our galaxy. 
All galaxies contribute some thermal dust emission. However, the galaxies at high redshift in 
which star formation is occurring at a rapid rate contribute predominantly to the extragalactic 
thermal dust emission in the microwave bands of interest to us. In fact, most of the energy 
of the radiation from these galaxies in which intense star formation is taking place is emitted 
in the infrared, because the light and UV emitted is absorbed by dust, which becomes heated 
and in turn re-emits in the infrared. 

Some of these infrared galaxies are resolved by current observation. Others remain unre¬ 
solved by present day observations and these are attributed to the diffuse infrared background. 
These infrared galaxies do not all emit with the same frequency spectrum, in part because they 
are spread over a wide range of redshifts. For the early discovery papers on the CIB (cosmic 
infrared background) from COBE see Refs. |191[ 182] and [63]. For more recent observations 
from Planck, see Ref. m- 


21 Other Effects 

21.1 Patchy reionization 

In Sec. |7.5[ we idealized the reionized hot electron gas extending from intermediate redshifts 
to today as being isotropically distributed so that the free electron density depends only on 
redshift. Such a reionization scenario is implausible but provides a good approximation for 
the effect of reionization on the CMB power spectrum at low £. In a more realistic reionization 
scenario, the universe becomes reionized in an inhomogeneous manner. As the first stars and 
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quasars form, their UV radiation creates around them spheres of almost completely ionized 
HII that grow and eventually coalesce. These spheres have sharp edges that act to transform 
large-angle power from the primordial CMB into small-scale power. For some estimates of the 
impact due to patchy reionization of the angular CMB spectrum, see Ref. [262] . 

21.2 Molecular lines 

The sources of diffuse galactic emission discussed so far all have a broad continuum emission 
spectrum. But there also exist molecular lines, mostly from rotational transitions of molecules, 
that emit in the microwave bands of interest here. The brightest of these are the CO lines 
[38], of which the lowest is situated at 115.3 GHz. Since CO is a linear molecule, there are 
also transitions at integer multiples of this frequency. Removing CO emission from the Planck 
data has proven to be a challenge (Ml- For more sensitive future experiments, it is likely 
that other lines will also present concerns. 

21.3 Zodiacal emission 

Besides the planets and asteroids, our solar system is filled with large dust grains, known as 
zodiacal dust, which on the average are much larger than the interstellar dust grains described 
above. These grains manifest themselves by a variety of means. In the visible they scatter 
light from the sun, and this background must be subtracted when analyzing diffuse emission 
in the optical and near infrared bands. The zodiacal dust is concentrated around the ecliptic 
plane. It is a good but not perfect approximation to model the distribution of zodiacal dust 
assuming azimuthal symmetry. The temperature of the zodiacal dust is much above that of 
the interstellar dust and roughly proportional to the inverse square of the distance from the 
sun. Subtracting zodiacal emission is challenging because its contribution to a given point 
on the celestial sphere depends on the time of year when the observation is made, because 
observations at different times of the year look through different columns of dust, even when 
azimuthal symmetry is assumed. The ideal observing program for characterizing the zodiacal 
dust would cover the whole sky many times over the year in order to accumulate as many 
lines of sight to the same point on the celestial sphere as possible and to carry out a sort of 
tomography. For a discussion of zodiacal emission at long wavelengths as measured by Planck, 
see Ref. j!68j . For previous work on zodiacal emission in the far infrared, see Ref. [65]. 

22 Extracting the Primordial CMB Anisotropies 

Most of the contaminants described above, with the exception of gravitational lensing and 
the kinetic SZ effect, have a frequency dependence that differs from that of the primary CMB 
anisotropies. This property can be exploited to remove these nonprimordial contaminants 
and lies at the heart of all component separation techniques. If there are N distinct com¬ 
ponents, each of which is characterized by a fixed frequency spectrum, in the absence of 
measurement noise, a perfect separation of these components is possible using sky maps in N 
frequency bands. With more bands, the separation is overdetermined allowing validation of 
the model. The component separation strategy sketched above, which may be described as a 
linear method, is based on a linear model, but this is not the only way to clean maps. More 
sophisticated nonlinear models may be formulated, often based on or inspired by Bayesian 
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statistics that exploit other properties such as the positivity of certain components, spatial 
variation of spectral indices, and so on. Some methods are characterized as being “blind,” 
meaning that the number of components is indicated by data itself, while other methods are 
more physically based. 

Because of limitations of space, we do not go into the details or practicalities of component 
separation, instead referring the reader to a few excellent reviews for more details. For an 
early discussion of the component separation problem and various approaches, see Refs. m 
and H- Reference ura includes a comparison of methods used for the Planck analysis. 
Reference [56] describes foreground removal challenges for future CMB B mode polarization 
experiments and includes extensive references to the literature. 


23 Concluding Remarks 

Prior to the discovery of the CMB anisotropy in 1992 by the COBE team, cosmology was a 
held with a questionable reputation. Many from other fields described cosmology as a held 
where theorists were free to speculate with almost no observational or experimental constraints 
to contradict them. Much was made of the supposed distinction between an “observation” 
and an “experiment.”^] However, between 1992 and today cosmology has transformed into 
a precision science where it has become increasingly difficult to propose new models without 
contradicting present observation. Ironically, much science having only a tenuous connection 
to cosmology is now being explained to the public as serving to probe the big bang. 

In this review, we have tried to sketch the most important aspects of the physics of the 
CMB in a self-contained way. Other reviews with a different emphasis include Refs. m, P 23 ! 
and IMj. We have seen how the CMB provides a snapshot of the state of the universe at 
around z « 1100, when the primordial perturbations were still very nearly linear, and how 
the precise angular pattern on the sky is also sensitive to a number of other cosmological 
parameters, which can be determined in many cases to the percent level or better. We have 
also explored tests of non-Gaussianity. At present no non-Gaussianity of a primordial origin 
has been observed, ruling out many nonminimal models of inflation, in particular multi-field 
models developed to explain hints of non-Gaussianity in the WMAP data, which subsequently 
were excluded by Planck. 

We also saw how better measurements of the CMB absolute spectrum could reveal devi¬ 
ations from a perfect blackbody spectrum, some of which are to be expected and others of 
which would constitute signs of novel physics, such as energy injection at high-redshift. The 
absolute spectrum remains a promising frontier of CMB science. 

The other promising area of CMB research is the characterization of the polarized sky, 
in particular to search for primordial B modes from inflation, which constitutes the most 
unique prediction of inflation. Maps of B modes on smaller angular scales would also allow 
characterization of the gravitational lcnsing held at much greater precision than is possible 
from observations of the temperature anisotropy. Such data would provide an important cross 
check of weak lensing surveys using the cllipticities of galaxies and provide a unique probe of 
absolute neutrino masses, perhaps allowing one to distinguish the normal from the inverted 

w It is not clear whether proton-antiproton colliders would be classified as constituting an “experiment” 
under this criterion, because the experimenter has virtually no control over parameters of the hard parton- 
parton collisions of interest due to the broad spread in x of the parton distribution functions. 
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mass hierarchy. 

Since this volume is dedicated to Albert Einstein and the hundredth anniversary of his 
celebrated paper presenting the general theory of relativity, we close with a few remarks 
about Einstein, and in particular about his relation to the subject of this chapter. Einstein is 
remembered by today’s generation of physicists mostly for his role in the development of the 
special and general theories of relativity. This part of Einstein’s work sometimes overshadows 
Einstein’s numerous achievements in other areas and the extreme breadth of his scientific 
interests. Einstein is often viewed today as the paragon of the abstruse, highly mathematical 
theorist. Many physicists today would be surprised to learn that Einstein patented a design for 
an ammonia-butane refrigerator |59| . Others might be surprised to learn that Einstein is the 
author of one experimental paper for work after which the Einstein-de Haas effect is named. 


This effect may be thought of as the converse of the Barnett effect, which we saw in Sec. |19.5.1 
plays a crucial role in the alignment of interstellar dust grains. Moreover, the stochastic 
differential equations in that section bear a close relation to Einstein’s work on Brownian 
motion. General relativity was once regarded as an abstruse, highly mathematical subfield 
of physics. Within astrophysics, relativity was first introduced through the emergence of a 
subfield known as “relativistic astrophysics.” But nowadays general relativity pervades almost 
all areas of astrophysics and has become one of the elements of the underlying fundamental 
physics that every astrophysicist must know. One can no longer neatly separate “relativistic 
astrophysics” from the rest of astrophysics. 

This chapter told the story of the big bang cosmology from a point of view emphasizing 
the observations of the relic blackbody radiation. Einstein’s theory of general relativity cul¬ 
minating in his celebrated 1915 formulation of the gravitational held equal ions j x | y | enters as a 
key ingredient to this story in two ways. First there is unperturbed spacetime solution now 
known as the FLRW solutions. This family of spatially homogeneous and isotropic solutions 
to the Einstein equations was first found by Alexandre Friedmann in 1922 and independently 
by Georges Lemaitre in 1927. At that time, the solutions were surprising because of their 
prediction of a non-eternal dynamical universe whose natural state was either expansion or 
contraction, an idea that was before its time. It was not until 1929 that Hubble discovered 
that the recessional velocities of distant galaxies except for some noise vary approximately 
linearly with distance (although in its first version the data was not statistically compelling). 
Einstein’s general theory was deduced by drawing mainly on considerations of mathemati¬ 
cal simplicity and elegance and on the equivalence principle, but remarkably applied to the 
expanding universe through the FLRW solution and its linearized perturbations, general rel¬ 
ativity remains the basis of cosmology despite the great improvement in the data that has 
taken place between the 1920s and now. Today there is much talk about “modified gravity,” 
largely motivated by the discovery of the so-called “dark energy,” which no one yet really 
understands. It can be argued whether the dark energy belongs to the left hand side of the 
Einstein field equation, in which case it would constitute a revision to Einstein’s theory, or 
on the right-hand side, in which case it would simply constitute a new form of exotic stress- 


X A. Einstein, “Die Feldgleichungen der Gravitation,” Sitzungberichte der Konigliche Preussiche Akademie 
der Wissenschaften, p. 844 (25 November, 1915). 

y For an authoritative account from a scientific perspective of the path involving over a dozen papers and 
several reversals that finally led Einstein to the gravitational field equations as we know them today (i.e., 
= R^i, — = (87rGAr)T M y), see Abraham Pais, Subtle is the Lord (New York, Oxford University 

Press, 1982). 
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energy, leaving the gravitational sector unchanged. But so far Einstein’s theory of relativity 
as a classical field theory has survived without change. To be sure, we believe that gravita¬ 
tional theory as formulated by Einstein is an incomplete story. Where quantum effects become 
relevant, we know that there must be a more complete theory of which classical relativity is 
but a limiting case. But we do not yet know what that larger theory is. In the classical 
regime, Einstein’s theory has survived many stringent tests, and perhaps more importantly 
has become an indispensable foundation for our current understanding of the universe and its 
origins. 
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